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FZEZfD OF TBS ZNVBNTION 

The present invention relates to heterologous fusion 
proteins, including analogs and derivatives thereof, fused 
to proteins that have the effect of extending the in vivo 
half -life of the proteins* These fusion proteins are 
significant in human medicine, particularly in the treatment 
of conditions treatable by stimulation of circulating 
neutrophils, such as after chemotherapy regimens or in 
chronic congenital neutropenia. More specifically, the 
invention relates to novel heterologous fusion proteins with 
granulocyte-colony stimulating factor activity- 

BACKGROUMD OF THE INVEiKPIOH 
Among all blood cell lineages, the modulation of 
neutrophil and platelet production has be^ of highest 
interest to clinical oncologists and hematologists . 
Myelosuppression is the single most severe complication of 
cancer chemotherapy, and a major cause of treatment delay 
during multiple-cycle or combination chCTio therapy . It is 
also the major dose-limiting factor for most 
chemotherapeutic agents. Due to the short half- lives of 
neutrophils in peripheral blood, life-threatening falls in 
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neutrophil levels are seen after a mjniber of conventioxial 
anti-tumor chemotherapy regimens* 

The most prominent regulator of granulopoiesis is 
5 sniranulocyte- colony stimulating factor (G-CSF) . 6-CSF 

induces proliferation and differentiation of hematopoietic 
progenitor cells resulting in increased nxonibers of 
circulating neutrophils, G-CSF also stimulates the release 
of mature neutrophils from bone marrow and activates their 
10 functional state, [Souza L»M., et al. (1986) Science 

232:61-55]. Thus, therapeutic proteins with G-CSF activity 
^ have tremendous value in situations where there are reduced 

y circulating levels of neutrophilic granuloctyes, 

^ However, the usefulness of therapy using G-CSF peptides 

15 has been limited by their short plasma half-life. Thus, 
^ they must be administered intravenously or sxibcutaneously at 

^ fairly frequent intervals (once or twice a day) in order to 

maintain their neutrophil stimulating properties. In 
^ addition, this short half-life limits the performance of the 

l,n 20 drug to traditional drug delivery systems. It would clearly 
benefit the treatment of patients with abnormally low 
neutrophils, and reduce the discomfort and inconvenience 
associated with frequent injections to provide a 
pharmaceutical agent that could be administered less 
25 frequently and optionally by alternative routes of 

administration. Thus, a need exists to develop agents that 
stimulate the production of mature neutrophils aund are more 
optimal in their duration of effect. 

The present invention overcomes the problems associated 
30 with delivering a compound that has a short plasma half -life 
in two respects. First, 6-CSF is hyperglycosylated. fPhe 
carbohydrate content of 6-CSF is altered by substituting 
amino acids that can act as substrates for glycosylating 
enzymes in mammalian cells. Mbst significantly, the present 
35 invention encompasses fusion of these hyperglycosylated 6- 
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CSF analogs to another protein with a long circulating half- 
life such as the Pc portion of an immunoglobulin or albumin. 

BRIEF SmMftRY OF THE mVEMTION 

5 Conipounds of the present invention include heterologous 

fusion proteins comprising a hyperglycosylated G-CSP analog 
fused to a polypeptide selected from the group consisting of 

a) hiiman albumin; 

b) human albumin analogs; and 
10 c) fragments of hvmian albumin. 

Additional coitipounds of the present invention include a 
^ heterologous fusion protein comprising a hyperglycosylated 

W G-CSF analog fused to a polypeptide selected from the group 

P consisting of 

^15 a) the Fc portion of an immunoglobulin; 

^ b) an analog of the Fc portion of an immunoglobulin; 

^ and 

S c) fragments of the Fc portion of an immxinoglobulin. 

y The hyperglycosylated G-CSF analog may be fused to the 

Ifl 20 polypeptide via a peptide linker. It is preferable that the 
ffi peptide linker is selected from the group consisting of: 

a) a glycine rich peptide; 

b) a peptide having the sequence [Gly-Gly-Gly-Gly- 
Ser]n where n is 1, 2, 3, 4, or 5; and 

25 c) a peptide having the sequence [Gly-61y-Gly-Gly-Ser]3 



One aspect of the present invention includes 
heterologous fusion proteins, wherein the hyperglycosylated 
30 6-CSF analogs have the Formula (I) [SEQ ID N0:1] 

•1 5 10 15 

Thr Pro lieu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys 
20 25 30 

35 Xaa Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin 
35 40 45 

Glu Lys Leu Cys Xaa Xaa Xaa Lys Leu Cys His Pro Glu Glu Leu Val 

50 55 60 

Leu Leu Gly His Ser Leu Gly He Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
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65 70 75 80 

Xaa Xaa Xaa Xaa Xaa Gin Leu Ala Gly Cys I*eu Ser Gin Leu Hxs Ser 

85 90 '55 

Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu xaa Xaa Xaa Ser 

100 105 110 

Xaa Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 

115 120 125 

Phe Ala Thr Oftir lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 

130 135 140 

Ala Leu Gin Pro Xaa Xaa Xaa Ala Met Pro Ala Phe Xaa Xaa Xaa Phe 
145 150 155 160 

Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 

165 170 
Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro (I) 



wherein : 

Xaa at position 17 is Cys, Ala, Leu, Ser, or Glu; 
Xaa at position 37 is Ala or Asn; 

Xaa at position 38 is Thr, or any other amino acid except 
Pro; 

Xaa at position 39 is Tyr, Thr, or Ser; 
Xaa at position 57 is Pro or Val; 
Xaa at position 58 is Trp or Asn; 

Xaa at position 59 is Ala or any other amino acid except 
Pro; 

Xaa at position 60 is Pro, Thr, Asn, or Ser, 

Xaa at position 61 is Leu, or any other amino acid except 

Pro; 

Xaa at position 62 is Ser or Thr; 
Xaa at position 63 is Ser or Asn; 

Xaa at position 64 is Cys or any other amino acid except 
Pro; 

Xaa at position 65 is Pro, Ser, or Thr; 
Xaa at position 66 is Ser or Thr; 
Xaa at position 67 is Gin or Asn; 

Xaa at position 68 is Ala or any other aiaino acid except 
Pro; 

Xaa at position 69 is Leu, Thr, or Ser 
Xaa at position 93 is Glu or Asn 

Xaa at position 94 is Gly or any other amino acid except 
Pro; 

Xaa at position 95 is He, Asn, Ser, or Thr; 
• Xaa at position 97 is Pro, Ser, Thr, or Asn; 
xaa at position 133 is Thr or Asn; 

Xaa at position 134 is Gin or any other amino acid except 
Pro; 

Xaa at position 135 is Gly, Ser, or Thr 
Xaa at position 141 is Ala or Asn; 

Xaa at position 142 is Ser or any other amino acid except 
Pro; and 
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Xaa at position 143 is Ala, Ser, or Ihr; 



and 


wherein : 














Xaa 


at positions 


37, 


38, 


and 


39 


constitute region 


1; 


Xaa 


at positions 


58, 


59, 


and 


60 


constitute region 


2; 


Xaa 


at positions 


59, 


60, 


and 


61 


constitute region 


3; 


Xaa 


at positions 


60, 


61, 


and 


62 


constitute region 


4; 


xaa 


at positions 


61, 


62, 


and 


63 


constitute region 


5; 


Xaa 


at positions 


62, 


63, 


and 


64 


constitute region 


6; 


Xaa 


at positions 


63, 


64, 


and 


65 


constitute region 


7; 


Xaa 


at positions 


64, 


65, 


and 


66 


constitute region 


8; 


Xaa 


at positions 


67, 


68, 


and 


69 


constitute region 


9; 


Xaa 


at positiozis 


93, 


94, 


and 


95 


constitute region 


10; 




at positions 


94, 


95, 


and 


Ser at position 96 





constitute region 11; 



Xaa at positions 95, and 97, and Ser at position 96 

constitute region 12; 
Xaa at positions 133, 134, and 135 constitute 

region 13; 

Xaa at positions 141, 142, and 143 constitute 
region 14; 

and provided that at least one of regions 1 through 14 
conqprises the sequence Asn Xaal Xaa2 wherein Xaal is any 
amino acid except Pro and Xaa2 is Ser or Thr. 

Thus, the heterologous fusion proteins of the present 
invention include analogs wherein one or any coxnbination of 
two or more regions comprise the sequence Asn Xaal Xaa2 
wherein Xaal is any amino acid except Pro and Xaa2 is Ser or 
Thr. 

Preferred hyperglycosylated G-CSF analogs that make up 
part of the heterologous fusion proteins of the present 
invention, include the following: 
a) G-CSF [A37N,Y39T] 
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b) O-CSP [P57V, W58N, P60T] 

c) G-CSF[F60N,S62T] 

d) G-CSP[S63N,P65T] 

e) G-CSP[Q67N,L69T] 
5 f) G-CSP[E93N, I95T] 

g) G-CSF[T133N,G135T] 

h) G-CSF[A141N,A143T] 

. i) G-CSFtA37N,Y39T,P57V,W58N,P60T] 
j) G-CSF[A37N,Y39T,P60N,S62T] 
10 k) G-CSF[A37N,Y39T,S63N,P65T] 
^ 1) G-CSP[A37N,Y39T,Q67N,L69T] 

W m) G-CSP[A37N,Y39T,E931!J, I95T] 

n) G-CSF[A37N,Y39T,T133N,G135T] 
^ o) G-CSF[A37N,Y39T,A141N,A143T] 

15 p) G-CSF[A37N,Y39T,P57V,W58N, P60T,S63N,P65T] 
^ q) G-CSF[A37N,Y39T,P57V,W58N,P60T,Q67N,Ij69T] 

W r) G-CSF[A37N.Y39T,S63N,P65T,E93N,I95T] 

Ifl The present invention also includes heterologous fusion 

^ proteins, which are the product of the esqpression in a host 

20 cell o£ an exogenous DNA sequence, which contprises a DNA 

sequence encoding a heterologous fusion protein of Fonnula X 
(described above) fused to a DNA sequence encoding human 
albumin or the Fc portion of an immunolglobulin. 

A hyperglcosylated heterologous fusion protein of the 
25 present invention also includes polynucleotides encoding the 
heterologous fusion protein described herein, vectors 
conqprising these polynucleotides and host cells transfected 
or transformed with the vectors described herein. Also 
included is a process for producing a heterologous fusion . 
30 protein comprising the steps of transcribing and translating 
a polynucleotide described herein under conditions vritierein 
the heterologous fusion protein is expressed in detectable 
amoiints . 
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The present invention encoinpasses a method for 
increasing neutrophil levels in a mammal cozcs>rising the 
administration of a therapeutically effective amount of a 
heterologous fusion protein described above. The present 
invention also includes the use of the heterologous fusion 
proteins described above for the manufacture of a medicament 
for the treatment of patients with insufficient circulating 
neutrophil levels. 

The present invention also encompasses a pharmaceutical 
formulation adapted for the treatment of patients with 
insufficient neutrophil levels coniprising a glycosylated 
protein as described above. 

BRIEF DgSCRIPTIOW OF THE MtAWIWGS 

The invention is further illustrated with reference to 
the following drawings: 

Figure 1; Schematic illustrating foiirteen regions in 
human G-CSF wherein the amino acid sequence can be mutated 
to create functional glycosylation sites. 

PETAXItEP PESCRIFTIOW OF THE IWVEMTION 

The present invention coxnprises heterologous fusion 
proteins. As used herein, the term heterologous fusion 
protein means a hyperglycosylated G-CSF analog fused to 
human albumin, a human albumin analog, a human albumin 
fragment, the Fc portion of an immunoglobulin, an analog of 
the Fc portion of an immunoglobulin, or a fragment of the Fc 
portion of an immunoglobulin. The G-CSF analog may be fused 
directly, or fused via a peptide linker, to an albumin or Fc 
protein. The albumin and Fc portion may be fused to the G- 
CSF analogs at either terminus or at both termini. These 
heterologous fusion proteins are biologically active and 
have an increased half -life coxjpared to native G-CSF. 
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Hyperglycosylated G-CSF Analogs 

Hyperglycosylated analogs of Q-CSF as used herein 
refers to analogs of G-CSF with one or more changes in the 
axhino acid sequence which result in an increase in the 
number of sites for carbohydrate attachment coziipared with 
native human G-CSF expressed in animal cells ±n vivo. In 
addition, hyperglycosylated G-CSF analogs include htraian G- 
CSF wherein the 0-linked glycosylation site at position 133 
is replaced with an N-linked glycosylation site. Analogs 
are generated by site directed mutagenesis having 
siibstitution of amino acid residues creating new sites that 
are available for glycosylation. G-CSF analogs having a 
greater carbohydrate content than that found in native human 
G-CSF are generated by adding glycosylation sites that do 
not perturb the secondary, tertiary, and quaternary 
structure required for activity. Ihe heterologous fusion 
proteins of the present invention thus have a larger mass 
and an increased negative charge compeared to native G-CSF, 
they will not be as rapidly cleared from the circulation. 

It is preferred that the hyperglycosylated G-CSF analog 
portion of the heterologous fusion protein have 1, 2, 3, or 
4 additional sites for N-glycosylation. Figure 1 
illustrates fourteen different regions that can be 
glycosylated with very little effect on in vitro activity. 
Each region may be mutated to the consensus site for N- 
glycosylation addition which is Asn XI X2 wherein XI is any 
amino acid except Pro and X2 is Ser or Thr. It is preferred 
that the XI amino acid be any other amino acid except Trp, 
Asp, Glu, or Leu and it is most preferred that the XI amino 
acid be the naturally occurring amino acid. The scope of 
the present invention includes analogs wherein a single 
region (1 through 14) is mutated or wherein a region is 
mutated in coinbination with one or more other regions. 

A representative niunber of fusion proteins having 
single of multiple additional glycosylation sites have been 
expressed, purified, and characterized. The mutated amino 
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acids in the hyperglycosylated G-CSF portion of the 
heterologous fusion protein are identified in parentheses. 
For exaitrple, G-CSF[A37N, Y39T] is 6-CSF wherein the amino 
acids at positions 37 and 39 have been substituted to create 
5 a glycosylation site. This site of carbohydrate attachment 
is illustrated as region 1 in Figure 1. G- 
C:SF[A37N,Y39T,P57V,W58N,P60T] is an example of a G-CSF 
analog wherein amino acids in region 1 and region 2 are 
mutated to provide two functional glycosylation sites on a 
10 single molecule (Figure 1) . 

G-CSF [A3 7N,y39T,P57V,W58N,P60T,Q67N,L69T] is an example 
of a G-CSF analog wherein the amino acids in region 1, 
region 2, and region 9 are mutated to provide three 
functional glycosylation sites on a single molecule (Figure 

<f Native G-CSF can be used as the backbone to create the 

^ hyperglycosylated G-CSF analog portion of a particular 

gj heterologous fusion protein. In addition, the native G-CSF 

W backbone can be modified such that sxabstitutions in the 

II 20 regions defined in Figure 1 are made in the context of a 

0 different or improved heterologous fusion protein. For 

exaicple, native G-CSF with a Cysteine to Alanine 

substitution at position 17 may reduce aggregation and 

enhance stability and thus, can be used as the backbone used 

25 to create the hyperglycosylated G-CSF analog portion of the 

heterologous fusion proteins of the present invention. 

In addition, Reidhaar-Olson, et al., through alanine 

scanning mutagenesis, describe residues critical to the 

activity of human G-CSF* [Reidhaar-Olson, et al. (1996) 

30 Biochemistiy 35:9034-9041; See also Young, et al. (1997) 

Protein Science 6:1228-1236]. Thus, the hyperglycosylated 

G-CSF analogs can be modified by substituting amino acids 

outside the glycosylated regions described in Figure 1. 
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The heterologous fusion proteins of the present 
Invention also encoinpass G-CSF analogs wherein the O-linked 
glycosylation site at position 133 is imitated to serve as an 
N-linked glycosylation site. The N-linked carbohydrate will 
generally have a higher sialic acid content which will 
protect it from the rapid clearance laechanisxns associated 
with native G-CSP* 

The functions of a carbohydrate chain greatly depend on 
the structure of the attached carbohydrate moiety. 
Typically coinpounds with a higher sialic acid content will 
have better stability and longer half -lives in vivo. The N- 
linked oligosaccharides contain sialic acid in both an a2,3 
and an a2,6 linkage to galactose. [Takeuchi, et al. (1988) 
J". Biol. Cheat. 263:3657]. Typically the sialic acid in the 
a2,3 linkage is added to galactose on the mannose al,6 
branch and the sialic acid in the a2,6 linkage is added to 
the galactose on the mannose (Xl,3 branch. The enzymes that 
add these sialic acids (p-galactoside a2,3 sialyltransf erase 
and P-galactoside a2,6 sialyltransf erase) are most efficeint 
at adding sialic acid to the mannose al,6 and mannose al,3 
branches respectively. 

Tetra-antennary N- linked oligosachharides most commonly 
provide four possible sites for sialic acid attachment while 
bi- and tri-antennary oligosaccharide chains, which can 
substitute for the tetra-antennary form at Asn-liked sites, 
commonly have at most only two or three sialic acids 
attached. O-linked oligosaccharides commonly provide only 
two sites for sialic acid attachement. Mammalian cell 
cultures can be screened for those cells that preferentially 
add teta-antennary chains to the G-CSF analogs of the 
present invention, thereby maximizing the noimber of sites 
for sialic acid attachment. Different types of mammalian 
cells also differ with respect to the transferase enzymes 
present and consequently the sialic acid content and type of 
oligosachharide attached at each site. One way to optimize 
the carbohydrate content for a given G-CSF analog is to 




express the analog in a cell line wherein an expression 
plasmid containing DNA encoding a specific sialyl 
transferase (e.g., a2,6 sialyltrasnf erase) is co-transfected 
with the G-CSF analog expression plasmid. 
5 Alternatively a host cell line may be stably transfected 
with a sialyl transferase cDNA and that host cell used to 
express the G-CSP analog of interest. Thus, it is 
preferable if the oligosaccharide structure and sialic acid 
content are optimized for each analog encompassed by the 
10 present invention. 



ru 



^J* Heterologous Fc fusion proteins: 

y Mie hyperglycosylated G-CSF analogs described above can 

fil be fused directly or via a peptide linker to the Fc portion 

^15 of an immunoglobulin. For example SEQ ID NO: 23 represents 
^ the sequence of a G-CSF-IgGl Fc fusion protein, SEQ ID NO: 24 

^ represents the sequence of a G-CSF-IgG4 Fc fusion protein, 

P and SEQ ID NO: 25 represents the sequence of a G-CSF-h\mian 

W albxHtiin fusion protein, 

yi 20 Immunoglobulins are molecules containing polypeptide 

§ chains held together by disulfide bonds typically having two 

light chains and two heavy chains. In each chain, one 
domain (V) has a variable amino acid sequence depending on 
the antibody specificity of the molecule. The other domains 
25 (C) have a rather constant sequence common to molecules of 
the same class. 

As used herein, the Fc portion of an immunoglobulin has 
the meaning commonly given to the teinn in the field of 
immunology. Specifically, this tenti refers to an antibody 
30 fragment which is obtained by removing the two antigen 
binding regions (the Fab fragments) from the antibody. 
Thus, the Fc portion is formed from approximately equal 
sized fragments of the constant region from both heavy 
chains, which associate through non-covalent interactions 
35 and disulfide bonds. The Fc portion can include the hinge 

regions and extend through the CH2 and CH3 domains to the C- 
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tertninus of the antibody. For example, SEQ ID NOs: 18 and 19 
represent the sequence of a Fc region encontpassing the hinge 
region as well as the CH2 and CH3 domains. Other 
representative hinge regions for human and mouse 
5 immunoglobulins can be found in Antibody Engineering, A 
Practical Guide, Borrebaeck, C.A.K., ed., W.H. Freeman and 
Co., 1992, the teachings of which are herein incorporated by 
reference . 

There are five types of human immxinoglobulin Fc regions 
10 with different effector and pharmacokinetic properties: IgG, 
IgA, IgM, IgD, and IgE, IgG is the most abundant 
^ immunoglobulin in serum, IgG also has the longest half-life 

y in serum of any immunoglobulin (23 days) . Unlike other 

p immunoglobulins, IgG is efficiently recirculated following 

[| 15 binding to an Fc receptor. There are four IgG subclasses 
Gl, Q2, G3, and G4, each of which have different effector 
fxmctions. . Gl, G2, and G3 can bind Clq and fix complement 
g3 while G4 cannot. Even though G3 is able to bind Clq more 

W efficiently than Gl, Gl is more effective at mediating 

Ijfj 20 contplement -directed cell lysis- G2 fixes complement very 
inefficiently. The Clq binding site in IgG is located at 
the carboxy terminal region of the CH2 domain. 

All IgG s\ibclasses are capable of binding to Fc 
receptors (CD16, CD32, CD64> with Gl and G3 being more 
25 effective than G2 and G4. The Fc receptor-binding region of 
IgG is formed by residues located in both the hinge and the 
carboxy terminal regions of the CH2 domain. 

IgA can exist both in a monomeric and dimeric form held 
together by a J-chain, IgA is the second most abundant Ig 
30 in serum, but it has a half -life of only 6 days. IgA has 
three effector f\inctions. It binds to an IgA specific 
receptor on macrophages and eosinophils, which drives 
phagocytosis and degranulation , respectively. It can also 
fix complement via an luiknown alternative pathway. 
35 IgM is expressed as either a pentamer or a hexamer, 

both of which are held together by a J-chain. XgM has a 
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senim half -life of 5 days. It binds weakly to Clq via a 
'binding site located in its CH3 domain. IgD has a half -life 
of 3 days in serum. It is unclear what effector fimctions 
are attributable to this Ig. igB is a monomeric Ig and has 
5 a serum half^life of 2.5 days. IgE binds to two Fc 

receptors which drives degranulation and results in the 
release of pro-inflammatory agents. 

Depending on the desired in vivo effect, the 
heterologous fusion proteins of the present invention may 
10 contain any of the isotypes described above or may contain 

mutated Fc regions wherein the complement and/or Fc receptor 

^ binding functions have been altered. For example, one 

£3 

y embodiment of the present invention is a heterologous fusion 

& protein wherein the Fc portion comprises a human Ig64 

15 sequence wherein serine at position 229 is changed to 
J proline which reduces the effector function of the protein. 

^ (See SEQ ID NO: 22). 

^ The heterologous fusion proteins of the present 

W invention may contain the entire Fc portion of an 

y3 20 immunoglobulin, fragments of the Fc portion of an 

immunoglobulin, or analogs thereof fused to a G-CSF analog. 
Furthermore, the Fc portion may be fused at either terminus 
or at both termini. 

The heterologous fusion proteins of the present 
25 invention can consist of single chain proteins or as multi- 
chain polypeptides. Two or more Fc fusion proteins can be 
produced such that they interact through disulfide bonds 
that naturally form between Fc regions. These multimers can 
be homogeneous with respect to the 6-CSF analog or they may 
30 contain different G-CSF analogs fused at the N-terminus of 
the Fc portion of the fusion protein. 

Regardless of the final structiire of the fusion 
protein, the Fc or Fc-like region serves to prolong the in 
vivo plasma half- life of the G-CSF analog coitpared to native 
35 6-CSF. Furthermore, the fused G-CSF analog must retain some 
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biological activity. Biological activity can be determined 
by in vitro and in vivo methods known in the art. 

Since the Fc region of IgG produced by proteolysis has 
the same in vivo half-life as the intact IgG molecule and 
Fab fragments are rapidly degraded, it is believed that the 
relevant sequence for prolonging half-life resides in the 
CH2 and/ or CH3 domains. Further, it has been shown in the 
literature that the catabolic rates of IgG variants that do 
not bind the high-af f xnity Fc receptor or Clq are 
indistinguishable from the rate of clearance of the parent 
wild-type antibody, indicating that the catabolic site is 
distinct from the sites involved in Fc receptor or Clq 
binding. [Wawrzynczak et al., (1992) Molecular Imaunology 
29:221]. Site-directed mutagenesis studies using a murine 
IgGl Fc region suggested that the site of the igGl Fc region 
that controls the catabolic rate is located at the CH2-CH3 
domain interface. 

Based on these studies, Fc regions can be modified at 
the catabolic site to optimize the half-life of the fusion 
proteins. It is preferable that the Fc region used for the 
heterologous fusion proteins of the present invention be 
derived from an IgGl such as that represented by SEQ ID NO: 
21 or an IgG4 Fc region such as that represented by SEQ ID 
NO: 22. It is even more preferable that the Fc region be 
IgG4 or derived from IgG4. Preferably the IgG Fc region 
contains both the CH2 and CHS regions including the hinge 
region . 

Heterologous albumin fusion ^>roteins: 

The G-CSF analogs described above can be fused directly 
or via a peptide linker to albumin or an analog, fragment, 
or derivative thereof. SEQ ID NO: 25 provides a 
representative G-CSF-human albumin fusion protein. 

Generally the albumin proteins making up part of the 
heterologous fusion proteins of the present invention can be 
derived from albumin cloned from any species. However, 



P-15648 



ru 



15 



human albxjmin and fragments and analogs thereof are 
preferred to reduce the risk of the fusion protein being 
Immunogenic In hxomans. Human serum albumin (HA) consists of 
a single non-glycosylated polypeptide chain of 585 amino 
5 acids with a formula molecular weight of 66,500. The amlno^ 
acid sequence of human albtimln Is represented as SEQ ID NO: 
20. [See Meloxin, et aJ. (1975) PEBS Letters 58:136; 
Behrens/ et al. (1975) Fed. Proc. 34:591; Lawn, et al. 
(1981) Nucleic Acids Research 9:6102-6114; Minghetti/ et al. 
10 (1986) J. Biol. Chem. 261:6747]. A variety of polymorphic 
variants as well as analogs and fragments of albumin have 
^ been described. [See Weitkainp/ et al., (1973) Ann. Hum. 

y Genet. 37:219]. For exanqple, in EP 322,094, the Inventors 

@1 disclose various shorter forms of HA. Some of these 

15 fragments include HA(l-373), HA(l-388), HA(l-389), HA(1- 
369), and HA(1-419) and fragments between 1-369 and 1-419. 
® EP 399,666 discloses albumin fragments that Include HA(1- 

g 177) and HA(l-200) and fragments between HA(1-177) and HA(1- 

W 200) . 

^20 It is understood that the heterologous fusion proteins 

^ of the present invention include hyperglycosylated G-CSF 

analogs that are coupled to any albumin protein including 
fragments, analogs, and derivatives wherein such fusion 
protein is biologically active and has a longer plasma half- 
25 life than native G-CSF. Thus, the albumin portion of the 

fusion protein need not necessarily have a plasma half-life 
equal to that of native h\aman albumin. In addition, the 
albumin may be fused to either terminus or both termini of 
the hyperglycosylated G-CSF analog. 
30 The heterologous fusion proteins of the present 

invention encompass proteins having conservative amino acid 
s\ibsti tut ions in the hyperglycosylated G-CSF analog and/ or 
the Fc or albumin portion of the fusion protein. 

The term "^amino acid'' is used herein in Its broadest 
35 sense, and Includes naturally occiirring amino acids as well 
as non-naturally occurring amino acids, including amino acid 



P-15648 



16 

analogs and derivatives. The latter includes molecules 
containing an amino acid moiety. One skilled in the art 
will recognize, in view of this broad definition, that 
reference herein to an amino acid includes, for example, 
naturally occurring proteogenic L-amino acids; D-amino 
acids; chemically modified amino acids such as amino acid 
analogs and derivatives; naturally occurring non-proteogenic 
amino acids such as nor leucine, p-alanine, ornithine, GABA, 
etc.; and chemically synthesized compoionds having properties 
known in the art to be characteristic of amino acids. As 
used herein, the term ^'proteogenic" indicates that the amino 
acid can be incorporated into a peptide, polypeptide, or 
protein in a cell through a metabolic pathway. 

The incorporation of non-natural amino acids, including 
synthetic non-native amino acids, sxabstituted amino acids, 
or one or more D-amino acids into the heterologous fusion 
proteins of the present invention can be advantageous in a 
number of different ways. D-amino acid-containing peptides, 
etc., exhibit increased stability in vitro or in vivo 
contpared to L-amino acid-containing counterparts. Thus, the 
construction of peptides, etc., incorporating D-amino acids 
can be particularly useful when greater intracellular 
stability is desired or required. More specifically, D- 
peptides, etc., are resistant to endogenous peptidases and 
proteases, thereby providing improved bioavailability of the 
molecule, and prolonged lifetimes in vivo when such 
properties are desirable. Additionally, D-peptides, etc., 
cannot be processed efficiently for major histocompatibility 
complex class Il-restricted presentation to T helper cells, 
and are therefore less likely to induce hiomoral immune 
responses in the whole organism. 

Although the heterologous fusion proteins of the 
present invention can be made by a variety of different 
methods, recombinant methods are preferred. 



P-15648 



17 

Construction of DNA encoding the heterologous fusion 
proteins of the present invention: 

Wild type albxunin and immunoglobulin proteins can be 
obtained from a variety of sources. For exanple, these 
proteins can be obtained from a cDNA librany prepared from 
tissue or cells which e^^ress the iniRNA of interest at a 
detectable level. Libraries can be screened with probes 
designed using the pxiblished DNA or protein sequence for the 
particular protein of interest. 

Screening a cDNA or genomic library with the selected 
probe may be conducted using standard procedures, such as 
described in Sambrook et al,. Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, NY 
(1989) . An alternative means to isolate a gene encoding an 
albvimin or immunoglobulin protein is to use PGR methodology 
[Sambrook et al,, supra; Dieffenbach et al., PCJR Primer: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, NY 
(1995)] . PGR primers can be designed based on published 
sequences • 

Generally the full-length wild- type sequences cloned 
from a particular species can serve as a template to create 
analogs, fragments, and derivatives that retain the ability 
to confer a longer plasma half-life on the G-CSF analog that 
is part of the fusion protein. It is preferred that the Fc 
and albumin portions of the heterologous fusion proteins of 
the present invention be derived from the native human 
sequence in order to reduce the risk of potential 
immunogenicity of the fusion protein in humans. 

In particular, it is preferred that the immxinoglobulin 
portion of a fusion protein encompassed by the present 
invention contain only an Fc fragment of the immunoglobulin. 
Depending on whether particular effector functions are 
desired and the structural characteristics of the fusion 
protein, an Fc fragment may contain the hinge region along 
with the CH2 and CH3 domains or some other coziibination 
thereof. These Fc fragments can be generated using PGR 
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techniques with primers designed to hybridize to sequences 
corresponding to the desired ends of the fragment. 
Similarly, if fragments of albvimin are desired, PGR primers 
can be designed which are coxnplementary to internal albumin 
sequences. PGR primers can also be designed to create 
restriction enzyme sites to facilitate cloning into 
expression vectors. 

DHA encoding hiiman G-CSF can be obtained from a cDNA 
library prepared from tissue or cells which express G-CSF 
mRNA at a detectable level such as monocytes, macrophages, 
vascular endothelial cells, fibroblasts, and some hioman 
malignant and leukemic myeloblastic cells. Libraries can be 
screened with probes designed using the published DNA 
sequence for human G-CSP. [Souza L., efc al. (1986) fi^cience 
232:61-65]. Screening a cDNA or genomic library with the 
selected probe may be conducted using standard procedures, 
such as described in Sambrook et al., iSfoIecuIar Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, NY 
(1989) . An alternative means to isolate the gene encoding 
human G-CSF is to use PGR methodology [Sambrook, et al., 
supra; Dieffenbach, et al., PCR Primer: A Laboratory Manual, 
Cold Spring Harbor Laboratory Press, NY (1995)]. 

The glycosylated G-CSF analogs of the present 
invention can be constiructed by a variety of mutagenesis 
techniques well known in the art. Specifically, a 
representative number of glycosylated G-CSF analogs were 
constructed using mutagenic PCR from a cloned wild-type 
h\iman G-CSF DNA template (Example 1) . 

The glycosylated G-CSF analogs of the present invention 
may be produced by other methods including recombinant DNA 
teclmology or well known chemical procedures, such as solution 
or solid-phase peptide synthesis, or semi -synthesis in solution 
beginning with protein fragments coupled through conventional 
solution methods. 

Recoinbinant DNA methods are preferred for producing the 
glycosylated G-CSF analogs of the present invention. Host 
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cells are transf ected or transformed with expression or 
cloning vectors described herein for glycosylated G-CSP 
analog production and cultured in conventional nutrient 
media modified as appropriate for inducing promoters, 
5 selecting transf orxnants, or anqplifying the genes encoding 
the desired sequences (Example 2) . The culture conditions, 
such as media, temperature, pH and the like, can be selected 
by the skilled artisan without undue experimentation 
Physical stability is an essential feature for 
10 therapeutic formulations. The physical stability of the 
heterologous fusion proteins of the present invention 
depends on their conformational stability, the nxmiber of 

S3 

y charged residues (pi of the protein) , the ionic strength and 

^ - pE of the formulation, and the protein concentration, among 
J 15 other possible factors. As discussed previously, the G-CSP 

analog portion of the heterologous fusion proteins can be 
^ successfully glycosylated and expressed such that the three 

Q dimensional structiire is maintained. Because these 

W heterologous fusion proteins are able to fold properly in a 

* 20 hyperglycosylated state, they will have improved 



conformational and physical stability relative to wild- type 
G-CSF. 

While wild-type 6-CSF produced in mammalian cells and 
bacterial cells has similar activity in vivo, the mammalian 

25 cell-produced protein has increased conformational and 

physical stability due to the presence of a single O-linked 
sugar moiety present at position 133. Hius, the G-CSF 
analog portion of the heterologus fusion proteins, which has 
an increased glycosylation content compared to wild-type G- 

30 CSF produced in mammalian or bacterial cells, will confer 
increased stability on the heterologous fusion protein. 
Furthermore, it is likely that glycosylation may inhibit 
inter-domain interactions and consequently enhance stability 
by preventing inter-domain disulfide shuffling. 

35 The gene encoding a heterologous fusion protein can be 

constructed by ligating DNA encoding a G-CSF analog in-frame 
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to DNA ^encoding an albumin or Fc protein. The gene encoding 
the G-CSF analog and the gene encoding the albtomin or Fc 
protein can also be joined in- frame via DNA encoding a 
linker peptide. 

5 The In vivo function and stability of the heterologous 

fusion proteins of the present invention can be optimized by 
adding small peptide linkers to prevent potentially unwanted 
domain interactions. Although these linkers can potentially 
be any length and consist of any combination of amino acids, 
10 it is preferred that the length be no longer than necessary 
to prevent unwanted domain interactions and/or optimize 
biological activity and/or stability. Generally, the 
linkers should not contain amino acids with extremely bulky 
side chains or amino acids likely to introduce significant 
15 secondary structure. It is preferred that the linker be 
^IP serine-glycine rich and be less than 30 amino acids in 

W length. It is more preferred that the linker be no more 

#g than 20 amino acids in length. It is even more preferred 

W that the linker be no more than 15 amino acids in length. A 

20 preferred linker contains repeats of the sequence Gly-Gly- 
0 Gly-Gly-Ser. It is preferred that there be between 2 and 6 

^ repeats of this sequence. It is even more preferred that 

there be between 3 and 4 repeats of this sequence. 

To construct the heterolgous G-CSF fusion proteins, the 
25 DNA encoding wild- type 6-CSF, albumin, and Fc polypeptides 
and fragments thereof can be mutated either before ligation 
or in the context of a cDNA encoding an entire fusion 
protein. A variety of mutagenesis technic[ues are well known 
in the art. For example, a mutagenic PGR method utilizes 
30 strand overlap extension to create specific base mutations 
for the purposes of changing a specific amino acid sequence 
in the corresponding protein. This PGR xmitagenesis requires 
the use of four primers, two in the forward orientation 
(primers A and C) and two in the reverse orientation 
35 (primers B and D) . A mutated gene is amplified from the 
wild-type template in two different stages. The first 



P-15648 



21 

reaction amplifies the gene in halves by perfonaing an A to 
B reaction and a separate C to D reaction wherein the B and 
C primers target the area of the gene to be imitated. When 
aligning these primers with the target area, they contain 
5 mismatches for the bases that are targeted to be changed. 
Once the A to B and C to D reactions are complete, the 
reaction products are isolated and mixed for use as the 
template for the A to D reaction. This reaction then yields 
the full, mutated product. 
10 Once a gene encoding an entire fusion protein is 

produced it can be cloned into an appropriate expression 
W vector. Specific strategies that can be employed to make 

y the 6-CSF fusion proteins of the present invention are 

^ described in example 1. 

15 

General methods to recombixiantly express the heterologous 
fusion proteins of the present Inv&ition: 
Q Host cells are transf ected or transformed with esqpression 

or cloning vectors described herein for heterologous fusion 
20 protein production and cultured in conventional nutrient media 
Q modified as appropriate for inducing promoters, selecting 

trans formants, or amplifying the genes encoding the desired 
sequences. The culture conditions, such as media, 
temperature, pH and the like, can be selected the skilled 
25 artisan without undue experimentation. In general, 

principles, protocols, and practical techniques for maximizing 
the productivity of cell cultures can be foxmd in Mammalian 
Cell Biotechnology: A Practical Approach, M. Butler, ed. (IKL 
Press, 1991) and Sambrook, efc al., supra. Methods of 
30 transf ection are known to the ordinarily skilled artisan, for 
example, CaPO^ and electroporation. General aspects of 
mammalian cell host system transformations have been described 
in U.S. Patent No. 4,399,216. Transformations into yeast are 
typically carried out according to the method of van Solingen, 
35 at al., J" Bact. 130(2): 946-7 (1977) and Hsiao, et al., Proc. 
Natl. Acad. Sci. USA 76(B) z 3829-33 (1979). Suitable host 
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cells for the esqpression of the fusion proteins of the present 
invention are derived from multicellular organisms. 

The fusion proteins of the present invention may be 
recombinantly produced directly, or as a protein having a 
signal sequence or other additional sequences which create a 
specific cleavage site at the N-terminus of the mature fusion 
protein. In general, the signal sequence may be a component 
of the vector, or it may be a part of the fusion protein- 
encoding DNA that is inserted into the vector. The signal 
sequence may be a prokaryotic signal sequence selected, for 
example, from the group of the alkaline phosphatase, 
penicillinase, Ipp, or heat-stable enterotoxin II leaders. 
For yeast secretion the signal sequence may be, e.g., the 
yeast invertase leader, alpha factor leader (including 
Saccharoniyces and Rluyveromyces cc-f actor leaders, the latter 
described in U.S. Patent No. 5,010,182), or acid phosphatase 
leader, the C. albicans glucoamylase leader (EP 362,179), or 
the signal described in WO 90/13646. In mammalian cell 
expression, mammalian signal sequences may be used to direct 
secretion of the protein, such as signal sequences from 
secreted polypeptides of the same or related species as well 
as viral secretory leaders. 

An example of suitable selectable markers for mammalian 
cells are those that enable the identification of cells 
conpetent to take up the fusion protein- encoding nucleic acid, 
such as DHFR or thymidine kinase. An appropriate host cell 
when wild-type DHFR is employed is the CHO cell line deficient 
in DHFR activity, prepared and propagated as described [Urlaub 
and Chasin, :Proc. Natl. Acad. Sci. USA, 77(7): 4216-20 
(1980)]. A suitable selection gene for use in yeast is the 
trpl gene present in the yeast plasmid yrp7 [Stinchcomb, et 
al./ Nature 282(5734) : 39-43 (1979); Kingsman, et al., Gezxe 
7(2): 141-52 (1979); Tschxiniper, et al.. Gene 10(2): 157-66 
(1980) I . The trpl gene provides a selection marker for a 
mutant strain of yeast lacking the ability to grow in 
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tryptophan, for example, ATCC No. 44076 or PEPCl [Jones, 
Genetics 85: 23-33 (1977)]. 

Various forms of a fusion protein may be recovered from 
culture medium or from host cell lysates. If menibrane-bound, 
it can be released from the membrane using a suitable 
deterg^t solution (e.g., Triton-X 100) or by enzymatic 
cleavage. Cells employed in expression of a fusion protein 
can be disrupted by various physical or chemical means, such 
as freeze- thaw cycling, soni cation, mechanical disruption, or 
cell lysing agents. 

Purification of the heterologous fusion proteins of the 
present inv^tion: 

Once the heterologous fusion proteins of the present 
invention are expressed in the appropriate host cell, the 
analogs can be isolated and purified. The following 
procedures are exemplary of suitable purification procedures: 

Various methods of protein purification may be employed 
and such methods are known in the art and described, for 
exainple, in Deutscher, Methods in Enzymology 182: 83-9 (1990) 
and Scopes, Protein Purification: Principles and Practice, 
Springer-Verlag, NY (1982). The purification step(s) selected 
will depend on the nature of the production process used and 
the particular fusion protein produced. For example, fusion 
proteins coitqprising an Pc fragment can be effectively purified 
using a Protein A or Protein G affinity matix. Low or high pH 
buffers can be used to elute the fusion protein from the 
affinity matrix. Mild elution conditions will aid in 
preventing irreversible denaturation of the fusion protein. 
Imidazole-containing buffers can also be used. Example 3 
describes some successful purification protocols for the 
fusion proteins of the present invention. 

Characterization of the heterologous fusion proteins of the 
present Invention: 



P-15648 



24 

Numerous methods exist to characterize the fusion 
proteins of the present invention • Some of these methods 
include: SDS-PA6E coupled with protein staining methods or 
imrminoblotting using anti-IgG, anti-HA and anti-rG-CSP 
antibodies. Other methods include matrix assisted laser 
desporption/ionizat ion-mass spectrometry (MALDI-MS) , liquid 
chromatography /mass spectrometry, isoelectric focusing, 
analytical anion exchange, chromatofocussing, and circular 
dichroism to name a few. A representative nuxhber of 
heterologous fusion proteins were characterized using SDS-PAGE 
coupled with immunoblotting as well as mass spectrometry 

For example. Table 2 illustrates the calculated molecular 
mass for a representative number of fusion proteins as well as 
the observed mass (as measured by protease mapping/LC-MS} . 
The relative differences between observed mass and mass 
calculated for a nonglycosylated protein are indicative of the 
extent of glycosylation . 



Table 2 



Construct 


Theoretical mass (kDa) 


Observed mass (kDa) 


C17A (GCSF)-IgGl Fc 


89.4 


93.0 


C17A(GCSF)-IgG4Fc 


88.7 


92.7 


C17A(GCSF)-BSA 


86.0 


85.1 


C17A, A37N, S63N, E93N 


89.4 


97-114 (104.7) 


(GCSF)-IgGl Fc 






C17A. A37N. S63N, E93N 


88.7 


96-112 (103.8) 


(GCSF)-IgG4Fc 






C17A, A37N, S63N, E93N 


86.0 


87-98 (92.5) 


(GCSF)-HSA 






C17A,A37N,W58N, 


89.3 


95-110 (103.8) 


Q67N (GCSF)-IgGl Fc 






C17A, A37N. W58N, 


88.8 


95-110 (102.4) 


Q67N(GCSF)-IgG4Fb 






C17A, A37N, W58N, 


85.9 


87-98 (93.1) 


Q67N (GCSF)-HS A 







Administration and Preparation of Compositions: 

The heterologous fusion proteins of the present invention 
may be fosomilated with one or more excipients. The active 
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fusion proteins of the present invention may be coxnbined with 
a pharnaceutically acceptable buffer, and the pH adjusted to 
provide acceptable stability, and a pH acceptable for 
admins tration such as parenteral administration. 
5 Optionally, one or more pharmaceutically-acceptable anti- 

microbial agents may be added. Meta-cresol and phenol are 
preferred phaarmaceutically-acceptable microbial agents. One 
or more pharmaceutically-acceptable salts may be added to 
adjust the ionic strength or tonicity. One or more excipients 
10 may be added to adjust the isotonicity of the formulation. 

Glycerin is an example of an isotonicity-adjusting excipient. 
f Pharmaceutically acceptable means suitable for administration 

y to a h\jman or other animal and thus, does not contain toxic 

^ elements or undersirable contaminants and does not interfere 

^ 15 with the activity of the active conpounds therein. 

A pharmaceutically-acceptable salt form of the 
S9 heterologous fusion proteins of the present invention may be 

g used in the present invention. Acids commonly employed to 

W form acid addition salts are inorganic acids such as 

20 hydrochloric acid, hydrobromic acid, hydriodic acid, sulfuric 
acid, phosphoric acid, and the like, and organic acids such as 
p-toluenesulfonic acid, methanesulfonic acid, oxalic acid, p- 
bromophenyl-sulfonic acid, carbonic acid, succinic acid, 
citric acid, benzoic acid, acetic acid, and the like. 
25 Preferred acid addition salts are those formed with mineral 
acids such as hydrochloric acid and hydrobromic acid. 

Base addition salts include those derived from inorganic 
bases, such as ammonium or alkali or alkaline earth metal 
hydroxides, carbonates, bicarbonates, and the like. Such 
30 bases useful in preparing the salts of this invention thus 
include sodiiam hydroxide, potassixim hydroxide, ammonium 
hydroxide, potassium carbonate, and the like. 



35 



Administration may be via any route known to be effective 
hy the physician of ordinary skill. Peripheral parenteral is 
one such method. Parenteral administration is commonly 
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understood in the medical literature as the injection of a 
dosage form into the body by a sterile syringe or some other 
mechanical device such as an infusion pump. Peripheral 
par^teral routes can include intravenous, intramuscular, 
subcutaneous, and intraperitoneal routes of administration. 

The heterologous fusion proteins of the present invention 
may also be amenable to administration by oral, rectal, nasal, 
or lower respiratory routes, which are non-petrenteral routes* 
Of these noii-parenteral routes, the lower respiratory route 
and the oral route are Dref erred. 

The heterologous fusion proteins of the present 
invention can be used to treat patients with insufficient 
circulating neutrophil levels, typically those xHidergoing 
cancer chemotherapy. 

An *ef fective amounf of the heterologous fusion 
protein is the quantity which results in a desired 
therapeutic and/or prophylactic effect without causing 
unacceptable side-effects when administered to a subject in 
need of G-CSF receptor stimulation. A '^desired therapeutic 
effect includes one or more of the following: 1) an 
amelioration of the symptom(s) associated with the disease 
or condition; 2) a delay in the onset of symptoms associated 
with the disease or condition; 3) increased longevity 
compared with the absence of the treatment; and 4) greater 
quality of life compared with the absence of the treatment. 

The present invention comprises G-CSF coxis>o\inds that 
have improved biochemical and biophysical properties by 
virtue of being fused to an albumin protein, an albumin 
fragment, an albumin analog, a Fc protein, a Fc fragment, or 
a Fc analog. These heterologous proteins can be 
successfully expressed in host cells, retain signaling 
activities associated with activation of the G-CSF receptor, 
and have prolonged half -lives. 
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EXAMPLES 

Example 1; Construction of DNA encoding heterologous fusion 

proteins ; 

Table 1 provides the sequence of primers used to create 
fimctional glycosylation sites in different regions of the 
G^CSF protein (See Figure 1) . 



Table 1: Primer sequences used to introduce imitations into 
h\jiman G-CSF, 



nutation 


A Primer* 


B Primer* 


C Primer* 


D Primer* 


WT 


CF177[SEQ ID 
NO:26] 

CGOkCGCTAaCGG 
CGCGCCaCCATG 

GCCG6ACCTGCC 
ACCCA6AGCCCC 
ATGAAGCT6 


CP178[SEQ ID 
N0:27] 

GGGGCAGGGAGC 
TGGCTGGGCCCA 
GTGGA6T6GCTT 
CCTGCACTGTCC 
AGAGTGCACTGT 
6 


CF179[SEQ ID 
NO:28] 

GGACAGTGCAG6 
AAGCCACTCCAC 
TGGGCCCaGCCA 
GCTCCCTGCCCC 
AGAGCTTCCTQ 


CF176CSEQ ID 
NO:29] 

QAACCTCGAGGA 

TCC?rCATTAGGG 
CT6GGCAAGGTG 
CCTTAAGACGCG 
GTACGACACCTC 
CAGGAA6CTCTG 


C17A 
Sad 


CP177[SEQ ID 
NO:30] 

QTAAGCTTGC6T 
CGACGCTAGCGO 
C6CGCCGCCATG 
GCCQGACCTGCC 
ACCCAGAGCCCC 
AT6AAGCT6 


C17Arev£SEQ 

ID NO: 31] 

GCTCTAAGGCCT 

TGAGCAGGAAGC 

TCTGGGGCAGGG 

AGCTCGCTGGGC 

CCAGTGGAG 


C17Afor[SEQ 

ID NO: 32] 

GGGCCCAGCGAG 

CTCCCTGCCCCA 

GAGCTTCCTGCT 

CAAGGCCTTAGA 

6CAA6 


CP176tSBQ ID 
NO:33] 

GAACCTCa&GGA 

JTCCTCATTAGGG 

CTGGGCAAG6TG' 

CCTTAAGACGCG 

GTACGACACCTC 

CAGGAAGCTCTG 


A37N,Y39T 
Spel 


CP177[S^ ID 
NO: 34] 

GTAAGCTTGCGT 
CGACGCTAGCGG 
CeCGCCGCCkTG 
GCCGGACCTGCC 
ACCCAGAGCCCC 
ATGAAGCTG 


A37Nrev[SEQ 
ID NO: 35] 

GTCCGAGCAGCA 
CTAGTTCCTCGG 
GGTGGCACAGCT 
TGGTGGTGTTAC 
ACAGCTTCTCCT 
G 


A37Nfor[SEQ 

ID NO:36] 

GGCGCAGCGCTC 

CAG6AGAAGCTG 

T6TAACACCACC 

AAGCTGTGCCAC 

CCCGAGGAACTA 

GTQCTQ 


CP176[SEQ ID 
NO:37] 

GaACcnoaaGea 

iTCCTCATTAGGG 
CTGGGCAAG6TG 
CCTTAAGACGCG 
GTACGACACCTC 
CAGGAAGCTCTG 


T133N, 
6135T 


CP177[SBQ ID 
NO:38] 


Tl33Nrev[SEQ 
ID NO:39] 


T133Nfor[SBQ 
ID NO: 40] 


CP176[SEQ ID 
NO: 41] 
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EC047III 


6TA2^6CrTGCGT 
CGACGCTAGCGG 
CGCGCCGCCh'SQ 
6CCG6ACCTGCC 
ACCCA6AGCCCC 
AT6AAGCT6 


GCCCGGCGCTGG 
AAAGCGCTGGCG 
AAGGCCG6CATG 
6CGGTCTG6TT6 
GGCTGCAG6GCA 
6 


GGCCCCTGCCCT 
GCAGCCCAACCA 
GACCGCCATGCC 
GGCCTTCGCCAS 
CGCTTTCCAGCG 


GAACerCGAGGA 
iPCCTCATTAGGG 
CTGGGCAAG6TG 
CCTTAAGACGCG 
GTACGACACCTC 
CAGGAAGCTCTG 


A141N, 

A143T 

Sapl 


CF177[SEQ ID 
NO:42] 

CGACGCTA6CGO 
CGCGCCGCCh'SG 
GCC6GACCTGCC 
ACCCAGAGCCCC 
ATGAAGCTG 


Al4lNrev[SEQ 

ID N0:43] 

GCCCGGCGCTGG 

AAGGTAGA6TT6 

AAGGCCGGCATG 

GCACCCTGGGTG 

GGCTGAAGAGCA 

GGGGCCAT 


Al41N£or[SEQ 

ID N0:44] 

GGQAATGGCCCC 

TGCTCTTCAGCC 

CACCCAGGGTGC 

CATGCCGGCCTT 

CAACTCTACCTT 

CCAGCGCCGGGC 

AG 


CF176CSEQ ID 
NO:45] 

GAACCTCGAGGA. 

TCCTChlTAGGG 
CTGGGCAAGGTG 
CCTTAAGACGCG 
GTACGACACCTC 
CAGGAAGCTCTG 




P57V, 

W58N,P60T 
Bpal 


JCB128[SEQ 
ID NO:46] 
GCTAGCGGCGCG 
CCACCATG 


JCB136CSEQ 
ID NO: 47] 
GCTCAGGGTAGC 
6TTAACGATGCC 
CAGA6AGTG 


JCB137 [SEQ 
ID NO: 48] 
GGGCATC6TTAA 
CGCTACCCTGAG 
CAGCTG 


JCB129 [SEQ 
ID NO: 49] 
6ACTCGAGGATC 
CTCATTAGGGCT 
GGG 


Q67N,L69T 
Nael 


JCB134[SEQ 

ID NO: 50] 

GCTAGCGGCGCG 

CCACCATGGCCG 

62WCCa<GCCACCC 

AG 


JCB138i:SEQ 

ID NO: 51] 

CAAGCAGCCGGC 

CAGCTGGGTC^ 

GTTGCTGGGGCA 

GCTGCTCAG 


JCB139 [SEQ 

ID NO:52] 

GCCCCAGCAACG 

CCACCCAGCTGG 

CCGGCTGCTTGA 

G 


JCB135 [SEQ 

ID NO: 53] 

fiaCTCGAGGATC 

CTCATTAGGGCT 

GG6CAAGGT6CC 

TTAAGACGCGG 


P60N,S62T 
Spel 


JCB128[SEQ 
ID NO: 54] 
GCTAGCGGCGCG 
CCACCATG 


JCB130CSEQ 
ID NO: 55] 
GGGGCAACTA6T 
CAGGTTAGCCCA 
6GG 


JCB131CSBQ 
ID NO: 56] 
GCTAACCTGACT 
AGTTGCCCCAGC 
CA6 


JCB129[SEQ 
ID NO: 57] 
GaCTCGAGGATC 
CTCATTAGGGCT 
GGG 


S63N,P65T 
Mfel 


JCB128 [SBQ 
ID NO: 58] 
GCTAGCGGCGCG 
CCACCATG 


JCB132[SEQ 
ID NO: 59] 
G6TGCAATTGCT 
CAGGGGAGCCCA 


JCB133[SEQ 
ID NO: 60] 
GCA&VT6CACCA 
6CCAGGCCCT6 


JCB129[SEQ 
ID NO: 61] 
GACTCGAGGATC 
CTCATTAGGGCT " 
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G 




G6G 


B93N, I95T 
BspBI 


JCB134[SEQ 

ID NO: 62] 

GCTA6CGGCGCG 

CC&CCATGGCCX3 

GACCTGCCACCC 

AG 


JCB140 [SBQ 

ID 110:63] 

CCG6ACTGGTCC 

CGTTCAGGGCCT 

6CAGGAGCCCCT 

G 


JCB141 [SEQ 
ID NO: 64] 
GAACGGGACCAO 
SCCGGA6TTGGG 
TCCCACCTTGG 


JCB135 I SEQ 

ID NO: 65] 

CyurrCGAGGATC 

CTCATTA6GGCT 

6GGCAAQ6TGCC 

TTAAGACGCGG 



Sail 


JCB155ISEQ 
ID NO: 66] 
GTCQACGCTAGC 
GGCGCGCCACCA 
TGGCCGGACCTG 









*Nucleoticles in bold represent changes iinposed in the target 
sequence and nucleotides in bold and italics represent 
flanking sequences which may add restriction sites to 
facilitiate cloning, Kozac sequences, or stop codons. 



Preparation la: DNA encoding wild- type human G-CSF 

A strand overlapping extension PGR reaction was used to 
create a wild type human G-CSF construct in order to 
eliminate the methylation of an Apal site. Isolated hioman 
6-CSF cDNA served as the tentplate for these reactions. The 
5.' end A primer was used to create a restriction enzyme site 
prior to the start of the coding region as well as to 
introduce a Kozac seQ[uence (G6C6CC) 5' of the coding leader 
sequence to faciliate translation in cell culture. 

The A-B product was generated using primers CF177 and 
CF178 in a PGR reaction. Likewise, the G-D product was 
produced with primers CF179 and CF176. The products were 
isolated and combined. The coiribined 3naixture was then used 
as a tentplate with primers CF177 and GF178 to create the 
full-length wild- type construct. [Nelson, R.M. and Long, 
G.C. (1989), Anal. Biochem. 180:147-151]. 

The full-length product was ligated into the pGR2.1- 
Topo vector (Invitrogen, Inc. Cat. No. K4500-40) by way of a 
topoisomerase TA overhang system to create pCR2.1G-CSF. 
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The following protocol was used for preparation of the 
full-length wild-type G-CSF protein as well as each of the 
G-CSF analogs. Approximately 5 ng of template DNA and 15 
pmol of each primer was used in the initial PGR reactions. 
5 The reactions were prepared using Platinum PGR Supermix® 
(GlbcoBRL Cat. No. 11305-016) • The PGR reactions were 
denatured at 94^G for 5 min and then subject to 25 cycles 
wherein each cycle consisted of 30 seconds at 94^G followed 
by 30 seconds at 60*^G followed by 30 seconds at 72^G. A 
10 final extension was carried out for 7 minutes at 72*'g. PGR 
fragments were isolated from agarose gels and purified using 
^ a Qiaquick® gel extraction kit (Qiagen, Gat- No. #28706). 

y DNA was resuspended in sterile water and used for the final 

^ PGR reaction to prepare full-length product. 

Preparation lb: DNA encoding G-GSP[A37NrY39T, 
P57V,W58N,P60T,Q67N,L69T] was constructed as follows: 

DNA encoding G-GSP[A37N, Y39T^Q67N, L69T] was subcloned 
into PJB02 to create pJB02G-GSF [A37N/ Y39T,Q67N, L69T] and 
y 20 pJB02G-CSF[A37N,Y39T, P57V,W58N,P60T] served as the template 
^ for strand overlapping escpression PGR. JCB155 and JGB136 

^ served as the A and B primers and JGB137 and JGB135 served 

as the G and D primers. The full-length mutated cDNA was 
prepared as described previously using JGB155 and JGB134 
25 ' primers. The resulting full-length DNA encodes a protein 
with consensus N-linked glycosylation sites in region 1, 
region 2, and region 9 of the protein (See Figure 1). The 
full-length cDNA was ligated back into pGR2.1-Topo to create 
pCR2 . IG-CSF [A37N, Y39T, P57V, W58N, P60T, 
30 Q67N,L69T] . 



m 
y 



Preparation Ic: DNA encoding G-GSF [A37N, Y39T, 
S63N,P64T,E93N/I95T] was constanicted as follows: 

DNA encoding G-GSP tA37N, Y39T,B93N, I95T] was subcloned 
35 into PJB02 to create pJB02G-GSF[A37N,Y39T,E93N,I95T] and 

pi7B02G-CSF[A37N,Y39T,E93N,X95T] served as the tengplate for 



10 
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strand overlapping e^qpression PCR. JCB155 and JCB132 served 
as the A and B primers and JCB133 and JCB135 served as the C 
and D primers. The full-length laatated cDNA was prepared as 
described previously using JCB155 and JCB135 primers. The 
resulting full-length DNA encodes a protein with consensus 
N-linked glycosylation sites in region 1, region 7, and 
region 10 of the protein (See Figure 1) . The full-length 
CDNA was ligated back into pCR2 • 1-Topo to create pCR2 • IQ- 
CSP[A37N,Y39T,S63N,P64T,E93N,I95T1 . 

Preparation Id: DNA encoding G-CSF[C17A] which is G-CSF 
® wherein the amino acid at position 17 is substituted with 

yj Ala is constructed as follows: 

Of The wild- type construct in the pCR2.1-Topo vector 

^ 15 {pCR2.1G-CSP) serves as the PCR tenqplate for the C17A 
^ . mutatgenesis . Strand ovelapping extension PCR is performed 
^ as described previously. CP177 and C17Arev serve as the A-B 

63 primers and C17Afor and CF176 serve as the C-D primers. The 

y full-length mutated cDNA is prepared as described previously 

^ 20 using the CF177 and CF176 primers. The B and C primers are 
h used to mutate the DNA such that a Saci restriction site is 

created and the protein expressed from the full-length 
sequence contains an Alanine instead of a Cysteine at 
position 17. The full-length cDNA is ligated back into the 
25 pCR2.1-Topo vector to create pCR2 . IG-CSP [C17A] wherein the 
sequence is confirmed. G-CSP analog encoding DNA is then 
cloned into the J»he/Xho sites of mammalian expression vector 
pJB02 to create pJB02G-CSF[C17A] . 

30 Preparation le: DNA encoding G-CSP tA37N,Y39T] is 

constructed as follows: 

Strand overlapping extension PCR is performed using 
pCR2 • IG-CSF [C17A1 as the template. Primers CF177 and 
A37Nrev serve as the A-B primers and CF176 and A37Nfor serve 
35 as the C-D primers. The full-length mutated cDNA is prepared 
as described previously using the CP177 and CF176 primers. 



ru 
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The B and C primers contain mismatched sequences such that a 
Spel site is created in the DNA and the protein expressed 
from the full-length sequence contains a consensus sequence 
for N-linked glycosylation in region 1 of the protein. The 
full-length cDNA is ligated back into the pCR2.1-Topo vector 
to create pCR2.1G-CSF[A37N,Y39Tl wherein the sequence is 
confirmed. G-CSP analog encoding DNA is then cloned into the 
Nhe/Xho sites of mammalian e3cpression vector pJB02 to create 
pJB026-CSF[A37N,Y39T] . 

Preparation If: DNA encoding G-CSF[P57V,W58N, P60T] is 
constructed as follows: 

Strand overlapping extension PGR is performed using 
PJB02G-CSP (C17A] as the template. Primers JCB128 and JCB136 
serve as the A-B primers and JCB137 and JCB129 serve as the 
C-D primers- The full-length mutated cDNA is prepared as 
described previously using the JCB128 and JCB129 primers. 
The B and C primers contain mismatched sequences such that a 
Hpal site is created and the protein es^ressed from the 
full-length sequence contains a consensus sequence for N- 
linked glycosylation in region 2 of the protein. The full- 
length CDNA is ligated back into the pCR2.1-Topo vector to 
create pCR2.1G-CSF[P57V,W58N,P60T] wherein the sequence is 
confirmed. G-CSF analog encoding DNA is then cloned into the 
Nhe/Xho sites of mammalian expression vector pJB02 to create 
PJB02G-CSF [P57V, W58N, P60T] . 

Preparation Ig: DNA encoding G-CSF [P60N,S62T] is 
constructed as follows: 

Strand overlapping extension PGR is performed using 
pJB02G-CSF[C17A] as the template. Primers JCB128 and JCB130 
serve as the A-B primers and JCB131 and JCB129 serve as the 
C-D primers. The full-length mutated cDNA is prepared as 
described previously using the JCB128 and JCB129 primers. 
The B and C primers contadLn mismatched sequences such that a 
Sjpel site is created and the protein expressed from the 
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full-length sequence contains a consensus sequence for N- 
linked glycosylation in region 4 of the protein. The full- 
length cDNA is ligated back into the pCR2.1-Topo vector to 
create pCR2.1G-CSF[P60N,S62T] wherein the sequence is 
5 confirmed. G-CSP analog encoding UNA is thai cloned into the 
Nhe/Xho sites of mammalian expression vector pJB02 to create 
pJB02G-CSPtP60N,S62T] . 

Pr^aration Ih: DNA encoding G-CSP [S63N,P65T] is 
10 constructed as follows: 

Strand overlapping extension PGR is performed using 
2 PJB02G-CSF [C17A1 as the tennplate. Primers JCB128 and JCB132 
W serve as the A-B primers and JCB133 and JCB129 serve as the 

^ C-D primers. The full-length mutated ciMlA is pr^ared as 
53 15 described previously using the JCB128 and JCB129 primers. 
^ The B and C primers contain mismatched sequences such that a 
Mfel site is created and the protein e:q?ressed from the 
full-length sequence contains a consensus sequence for N- 
^ linked glycosylation in region 7 of the protein. The full- 

length cDNA is ligated back into the pCR2.1-Topo vector to 
create pCR2 . IG-CSF [S63N, P65T] wherein the sequence is 
confirmed. G-CSF analog encoding DNA is then cloned into the 
Nhe/Xho sites of mammalian es^ression vector pJB02 to create 
PJB02G-CSP [S63N« P65T] . 

25 

Preparation li: DNA encoding G-CSP [Q67N, L69T] is 
constructed as follows: 

Strand overlapping extension PGR is performed using 
PJB02G-CSF (C17A] as the template. Primers JCB134 and JCB138 
30 serve as the A-B primers and JCB139 and JCB135 serve as the 
C-D primers. The full-length mutated cDNA is prepared as 
described previously using the JCB128 and JCB129 primers. 
The B and C primers contain mismatched sequences such that a 
Nael site is created and the protein expressed from the 
35 full-length sequence contains a consensus sequence for N- 
linked glycosylation in region 9 of the protein. The full- 
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length cDNA is ligated back into the pCR2.1-Topo vector to 
create pCR2 . 16-CSF tQ67N, L69T] wherein the sequence is 
confirmed. G-CSP analog encoding DNA is then cloned into the 
NhB/Xho sites of mammalian expression vector pJB02 to create 
5 PJB02G-CSP [Q67N, L69T] . 

Preparation 1 j : DNA encoding G-CSP [E93N, 195T] is 
constructed as follows: 

Strand overlapping extension PGR is performed using 
10 pJB02G-CSF[C17A]. as the teitiplate. Primers JCB134 and JCB140 
serve as the A-B primers and JCB141 and JCB135 serve as the 
^ C-D primers. The full-length mutated cDNA is prepared as 
y described previously using the iJCB128 and JCB129 primers. 
8^ The B and C primers contain mismatched sequences such that 
^15 a BspEI site is created and the protein expressed from the 
?f full-length sequence contains a consensus sequence for N- 

linked glycosylation in region 10 of the protein. The full- 
er length cDNA is ligated back into the pCR2.1-Topo vector to 
W create pCR2.1G-CSF[E93N, I95T] wherein the sequence is 

20 ' confirmed. G-CSF analog encoding DNA is then cloned into the 
S Nhe/Xtio sites of mammalian expression vector pJB02 to create 

^ PJB02G-CSF [E93T, I95T] . 

Preparation Ik: DNA encoding G-CSF [T133N,G135T1 is 

25 constructed as follows: 

Strand overlapping extension PGR is performed using 
pCR2 • IG-CSP [C17A] as the template. Primers CF177 and 
Tl33Nrev serve as the A-B primers and T133Nfor and CF176 
serve as the C-D primers. The full-length mutated cDNA is 

30 prepared as described previously using the CF177 and CF176 
primers. The B and C primers contain mismatched sequences 
such that an Eco47XII site is created and the protein 
expressed from the full-length sequence contains a consensus 
sequence for N-linked glycosylation in region 13 of the 

35 protein. The full-length cDNA is ligated back into the 
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pCR2*l-Topo vector to create pCR2.1G-CSFlTl33N,G135T] 
wherein the sequence is confirmed. 

Preparation 11: DMA encoding G-CSF[A141N, A143T] is 
5 constructed as follows: 

Strand overlapping extension PGR is performed using 
pCR2 . IG-CSF [C17A1 as the template. Primers CF177 and 
Al41Nrev serve as the A-B primers and Al41Nfor and CF176 
serve as ^ the C~D primers. The full-length imitated cDNA is 
10 prepared as described previously using the CF177 and CF176 
primers. The B and C primers contain mismatched sequences 
such that an Sapl site is created and the protein expressed 
y from the full-length sequence contains a consensus sequence 
^ for N-linked glycosylation in region 14 of the protein. The 

^ is full-length cDNA is ligated back into the pCR2.1-Topo vector 
HP to create pCR2 . 1G-CSF[A141N, A143T] wherein the sequence is 

^ conf irmed . 

W Preparation Im: DNA encoding G-CSF [A37N, Y39T,T133N, 

yi 20 G135T] is constructed as follows: 

g3 A 210 bp insert containing G-CSF [A37N, Y39T] is isolated 

^ from pCR2.1G-CSF[A37N,Y39T] using EcoNI. This fragment is 

ligated into pCR2 .1G-CSF[T133N,6135T] which is prepared by 
cleavage with EcdNI and subsequent isolation of the vector 
25 (4359 bp) from a 210 l^p fragment containing wild-type G-CSF 
sequences. This ligation creates pCR2.16- 
CSF[A37N, Y39T,T133N,G135T] . Analog encoding DNA is then 
subcloned into pJB02 using Nhel/XhoX to create pJB02G- 
CSF[A37N,Y39T,T133N,G135T1 . 

30 

Preparation In: DNA encoding G-CSF [ A3 7N,Y39T^A141N, 
A143T] is constructed as follows: 

A 210 bp insert containing G-CSF [ A3 7N,Y39T1 is isolated 
from pCR2.1G-CSF[A37N,Y39T] using l^coWJ. This fragment is 
35 ligated into pCR2 .1G-CSF[A141N,A143T] which is prepared by 
cleavage with EcoNX and subsequent isolation of the vector 
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(4359 bp) from a 210 bp fragment containing wild-type G-CSF 
sequences. This ligation creates pCR2.1G- 
CSF[A37N,Y39T,A141N,A143T] . Analog encoding DNA is then 
stabcloned into pJB02 using Nhel/Xhol to create pJB02G- 
5 CSF[A37N,Y39T,A141N,A143T] . 

Preparation lo: UNA encoding G-CSP[A37N,Y39T, P57V,W58N, 
P60T] is constructed as follows: 

DNA encoding 6-CSP[A37N,Y39T] is stjbcloned into pJB02 
10 to create pJB02G-CSF[A37N, Y39T] and pJB02G-CSF[A37W,Y39T] 
serves as the teir¥)late for strand overlapping expression 
^ PGR. JCB128 and JCB136 serve as the A and B primers and 

y JCB137 and JCB129 searve as the C and D primers. The full- 

^ length mutated cDNA is prepared as described previously 

^ 15 using JCB128 and JCB129 primers. The resulting full-length 
<i DNA encodes a protein with consensus N-linked glycosylation 
^ sites in r.egion 1 and region 2 of the protein. The full- 
O length cDNA is ligated back into pCR2 . 1-Topo to create 

^ pCR2 . IG-CSF [A37N, Y39T, P57V, W58N, P60T] . 

in 20 

© Preparation Ip: DNA encoding G-CSF [A37N,Y39T,Q67N,L69T] 

^ is constructed as follows: 

DNA encoding G-CSF [A37N,Y39T] is subcloned into pJB02 
to create pJB02G-CSF[A37N, Y39T] and pJB02G-CSF[A37N,Y39T] 
25 serves as the ten©late for strand overlaRping expression 
PCR. JCB134 and JCB138 serve as the A and B primers and 
JCB139 and JCB135 serve as the C and D primers. The full- 
length mutated cDNA is prepared as described previously 
using JCB128 and JCB129 primers. The resulting full-length 
30 DNA encodes a protein with consensus N-linked glycosylation 
sites in region 1 and region 9 of the protein. The full- 
length cDNA is ligated back into pCR2.1-Topo to create 
pCR2 . IG-CSF [A37N, Y39T, Q67N, L69T1 . 



35 



Preparation Iq: DNA encoding G-CSF [A37N, Y39T, E93N, 195T1 



is constructed as follows: 
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DNA encoding G-CSF [A37N, Y39T] is subcloned into pJB02 to 
create pJB02G-CSF[A37N, Y39T] and pJB02G-CSF [A37N, Y39T] 
serves as the template for strand overlapping expression 
PGR. JCB134 and JCB140 serve as the A and B primers and 
JCB141 and JCB135 serve as the C and D primers. The full- 
length mutated cDMA is prepared as described previously 
using JCB128 and JCB129 primers. Hie resulting full-length 
DNA encodes a protein with consensus N- linked glycosylation 
sites in region 1 and region 10 of the protein- The full- 
length cDNA is ligated back into pCR2.1-Topo to create 
pCR2 .lG-CSFtA37N, Y39T,E93N, I95T] . 

Various mutated G-CSF polypeptides were then ligated 
in- frame directly to DNA encoding IgGl or IgG4. Various 
mutated G-CSF polypeptides were also ligated in- frame via 
DNA encoding a peptide linker or directly to hximan albuniin. 

Exaiiyg>le 2; Expression of heterologus fusion proteins: 

2a: Expression in 293/EBNA cells: 

Each full-length DNA encoding a heterologous fusion 
protein was cloned into the pJB02, This vector contains 
both the Ori P and Epstein Barr virus nuclear antigen (EBNA) 
conponents which are necessary for sustained, transient 
expression in 293 EBNA cells. Further, this expression 
plasmid contains a puromycin resistance gene expressed from 
the CMV promoter as well as an ampicillin resistance gene, 
rfhe gene of interest is also expressed from the CMV 
promoter . 

The transfection mixture was prepared by mixing 73 Jil 
of the liposome transfection agent Fugene 6® {Roche 
Molecular Biochemicals, Cat. No. 1815-075) with 820 jil Qpti- 
Mem® (GibcoBRL Cat. No. 31985-062). G-CSF pJB02 DNA 
(12|xg), prepared using a Qiagen plasmid maxiprep kit 
(Qiagen, Cat. No. 12163), was then added to the mixture. 
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The mixture was incubated at room teits)erature for 15 
minutes . 

Cells were plated on 10 cm^ plates in DMEM/F12 3:1 
(GibcoBRL Cat. No, 93-0152DK) supplemented with 5% fetal 
5 bovine serum, 20nM HEPES, 2 nM L-glutamine, and 50 jig/mL 
Geneticin such that the plates were 60% to 80% confluent by 
the time of the transfection. Immediately before the 
transfection mixture was added to the plates, fresh media 
was added. The mixture was then added dropwise to cells 
10 with intermittent swirling. Plates were then incubated at 
37°C in a 5% CO2 atmosphere for 24 hours at which point the 
media was changed to hybritech medixim without serum. The 
media containing a secreted form of a glycosylated G-CSP 
in analog was then isolated 48 hours later. 



15 



Bp 2b: Expression in CHO cells: 

^ The es^ression vector for e3!pression in CHO-Kl cells is 

^ PEE14.1. This vector includes the glutamine synthetase gene 

^ which enables selection using methionine sulfoximine. This 

^ 20 gene includes two poly A signals at the 3' end. G-CSF 
d . analogs are expressed from the CMV promoter which includes 
^ 5' ointranslated sequences from the hCMV-MIE gene to enhance 

mRNA levels and trans la tability. The SV40 poly A signal is 
cloned 3' of the fusion protein DNA. The SV40 late promoter 
25 drives es^ression of GS minigene. This expression vector 
encoding the gene of interest is prepared for transfection 
using a QIAGEN Maxi Prep Kit (QIAGBN, Cat. No. 12362) . The 
final DNA pellet (50-100 ng) is resuspended in 100 Jil of 
basal formulation medium (GibcoBRL CD-CHO Medium without L- 
30 Glutamine, without thymidine, without hypoxan thine) . 

Before each transfection, CHO-Kl cells are counted and 
checked for viability. A volume equal to 1 x lO' cells is 
centrifuged and the cell pellet rinsed with basal 
formulation medium. The cells are centrifuged a second time 
35 and the final pellet resuspeaded in basal formulation medium 
(700 |Al final volume) . 
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The resuspended DNA and cells are then mixed together 
in a standard electroporation cuvette (Gene Pulsar Cuvette) 
used to support mammalian transfections, and placed on ice 
for five minutes. ThB cell/DNA mix is then el ectr operated 
in a BioRad Gene Pulsar device set at 300V/975 ^^e and the 
cuvette placed back on ice for five minutes. The cell/DNA 
mixed is diluted into 20 itil of cell growth medium in a non- 
tissue culture treated T75 flask and incubated at 37*^0 / 5% 
CO2 for 48-72 hours. 

The cells are counted, checked for viability, and 
plated at various cell densities in selective medium in 96 
well tissue culture plates and incubated at 37^C in a 5% CO2 
y atmosphere. Selective medium is basal medium with IX HT 

ffi Supplement (GibcoBRL lOOX HT Stock), 100 yia/JxiL Dextran 

15 Sulfate (Sigma 100 mg/ml stock), IX GS Supplements (JRH 
^iP Biosciences SOX Stock) and 25 MM MSX (Methionine 

^ Sulphoximine) . The plates are monitored for colony 

@ formation and screened for glycosylated 6-CSF analog 

y production. 
111 

h Exampl e 3; Purification of Heterologous Fusi on Proteins 

yy - 

Heterologous fusion proteins containing a 
hyperglycosylated 6-CSF protein fused to albaimin were 

25 harvested from host cells and then dialyzed against 20 mM 
Tris pH 7.4. An anion exchange coloomn (1 ml Pharmacia 
HiTrap Q) was equilibrated with 20 iriM Tris pH 7.4 and the 
dialyzed material loaded at 2 ml/min. The protein was 
eluted from the colxamn using a linear gradient from 0 to 500 

30 xm NaCl in 80 min at 1 ml/min and elution was monitored by 
UV absorbance at 280 mm. SDS-PAGE analysis was used to 
identify and pool fractions of interest. This pool was 
dialyzed against 25 iriM sodium acetate (NaOAc) e>H 5.0 

A cation exchange column (1 ml Pharmacia HiTrap S 

35 column) was equilibrated with 25 icM NaOAc pH 5.0 and the 
dialysate was loaded at 1 ml/min. The protein was eluted 
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from the column using a linear gradient from 0 to 500 nM 
NaCl in 30 min. The fractions were immediately neutralized 
with 1 M Tris pH 8 to a final pH of 7. SDS-PAGE gels were 
used to identify and pool fractions of interest. 

Heterologous fusion proteins containing a 
hyperglycosylated G-CSF protein fused to an Pc protein were 
harvested from host cells and dialyzed against 20 iriM sodi\am 
phosphate pH 7.O. An affinity coltimn (1 ml Pharmacia HiTrap 
Protein A or rProtein A) was equilibrated with 20 mM sodium 
phosphate pH 7.0 and the dialysate was loaded at 2 ml/min. 
1 ml/min of 100 iriM citric acid pH 3 was used to elute the 
protein. Fractions were immediately neutralized with IM 
Tris E« 8 to pH 7 and peak fractions (determined by in-line 
OD280 monitoring) were further diluted with 20 anM sodium 
phosphate pH 7.0. . SDS-PAGE analysis was used to identify 
and pool fractions of interest. 
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WE CLAIM ; 

1. A heterologous fusion protein comprising a 
hyperglycosylated G-CSF analog fused to a polypeptide 
selected from the group consisting of 

a) human albumin; 

b) human albumin analogs; and 

c) fragments of hiunan albtimin. 



2, The heterologous fusion protein of claim 1, wherein the 
hyperglycosylated G-CSF analog is fused to the polypeptide 
5 via a peptide linker. 

15 3 . The heterologous fusion protein of Claim 2 wherein the 
peptide linker is selected from the group consisting of: 

iifi 

a) a glycine rich peptide; 

b) a peptide having the sequence [Gly-Gly-Gly-Gly-SerJn 
where n is 1, 2, 3, 4, or 5; and 

y 20 c) a peptide having the sequence [Gly-Gly-Gly-Gly- 

H Ser]3. 

4. The heterologous fusion protein of Claims 1, 2, or 3 
wherein the hyperglycosylated G-CSF analog comprises the 
25 amino acid sequence of the formula I: [SBQ ID NO: 1] 
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Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His I.eu Aln Ser Phe 

165 170 
Leu Glu val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro (I) 

5 wherein: 

Xaa at position 17 is Cys, AIsl, Leu, Ser, or Glu; 
Xaa at position 37 is Ala or Asn; 

Xaa at position 38 is Thr, or any other amino acid except 
Pro; 

10 Xaa at position 39 is Tyr, Thr, or Ser; 
Xaa at position 57 is Pro or Val; 
Xaa at position 58 is Trp or Asn; 

Xaa at position 59 is Ala or any other amino acid except 
Pro; 

15 Xaa at position 60 is Pro, Thr, Asn, or Ser, 

Xaa at position 61 is Leu, or any other amino acid except 

apa Pro * 

Xaa' at position 62 is Ser or Thr; 

ffl xaa at position 63 is Ser or Asn; 

M 20 Xaa at position 64 is Cys or any other amino acid except 
^ Pro; 

4 xaa at position 65 is Pro, Ser, or Hir; 
W xaa at position 66 is Ser or Thr; 

* Xaa at position 67 is Gin or Asn; 

^ 25 Xaa at position 68 is Ala or any other amino acid except 

iiJ Pro ; 

5 Xaa at position 69 is Leu, Thr, or Ser 
S Xaa at position 93 is Glu or Asn 

xaa at position 94 is Gly or any other amino acid except 
30 Pro; 

xaa at position 95 is lie, Asn, Ser, or Thr; 
Xaa at position 97 is Pro, Ser, Thr, or Asn; 
Xaa at position 133 is Thr or Asn; 

Xaa at position 134 is Gin or any other amino acid except 
35 Pro; 

Xaa at position 135 is Gly, Ser, or Thr 
Xaa at position 141 is Ala or Asn; 

Xaa at position 142 is Ser or any other amino acid except 
Pro ; and 

40 Xaa at position 143 is Ala, Ser, or Thr; 
and wherein: 

Xaa at positions 37, 38, and 39 constitute region 1; 
Xaa at positions 58, 59, and 60 constitute region 2; 
45 Xaa at positions 59, 60, and 61 constitute region 3; 

Xaa at positions 60, 61, and 62 constitute region 4; 
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Xaa at positions 61, 62, and 63 constitute region 5; 
Xaa at positions 62, 63, and 64 constitute region 6; 
Xaa at positions 63, 64, and 65 constitute region 7; 
Xaa at positions 64, 65, and 66 constitute region 8; 
5 Xaa at positions 67, 68, and 69 constitute region 9; 

Xaa at positions 93, 94, and 95 constitute region 10; 
Xaa at positions 94, 95, and Ser at position 96 

constitute region 11; 
Xaa at positions 95, and 97, and Ser at position 96 
10 constitute region 12; 

Xaa at positions 133, 134, and 135 constitute 
^ region 13; 

Xaa at positions 141, 142, and 143 constitute 
region 14; 

15 

and provided that at least one of regions 1 through 14 
comprises the sequence Asn Xaal Xaa2 wherein Xaal is any 
amino acid except Pro and Xaa2 is Ser or Thr. 

W " 

20 5. The heterologous fusion protein of Claim 4 wherein any 
P two regions of regions 1 through 14 coinprise the sequence 

Asn Xaal Xaa2 wherein Xaal is any amino acid except Pro and 
Xaa2 is Ser or Thr. 

25 6. The heterologous fusion protein of Claim 4 wherein any 
three regions of regions 1 through 14 comprise the sequence 
Asn Xaal Xaa2 wherein Xaal is any amino acid except Pro and 
Xaa2 is Ser or Thr. 

30 7. The heterologous fusion protein of Claim 4 wherein any 
four regions of regions 1 through 14 congprise the secjuence 
Asn Xaal Xaa2 wherein Xaal is any amino acid except Pro and 
Xaa2 is Ser or Thr 
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8. The heterologous fusion protein of Claim 4 wherein the 
hyperglycosylated G-CSF analog is selected from the group 
consisting of: 

a) G-CSP[A37N,Y39T] 

5 b) G-CSP(P57V,W58N,P60T] 

C) Q-CSP[P60N,S62TJ 

d) 6-CSF[S63N,P65Tl 

e) G-CSF [Q67N,L69T] 

f) G-CSF [E93N, I95T] 
10 g) 6-CSF[Tl33N,G135T] 

h> G-CSF [A141N,A143T] 

^ i) G-CSF [A37N,Y39T,P57V,W58N,P60T] 

y j) G-CSF [A37N,Y39T,P60N,S62T] 

^ k) G-CSP[A37N,y39T,S63N,P65Tl 

yp 15 1) 6-CSF[A37N,y39T,Q67N,L69T] 

■f m) G-CSP[A37N,y39T,E93N,I95T] 

f n) G-CSP[A37N,Y39T,T133N,G135T] 

jS o) G-CSF (A37N,Y39T,A141N,A143T] 

p) G-CSF [A37N,Y39T,P57V,W58N,P60T,S63N,P65T] 

yi 20 q) G-CSF [A37N,Y39T,P57V,W58N,P60T,Q67N,L69T] 

§ r) G-CSF [A37N,Y39T,S63N,P65T,E93N,I95T3 

9. The heterologous fusion protein of claim 8, wherein the 
hyperglycosylated G-CSF analog is G-CSP[A37N, 

25 Y39T, P57V, W58N, P60T, Q67N, L69T] . 



10. ^e heterologous fusion protein of claim 8, wherein the 
hyperglycosylated G-CSF analog is G- 

CSP [A37N, Y39T, S63N, P65T, E93N, I95T] . 

11. A heterologous fusion protein which is the product of 
the escpression in a host cell of an exogenous DNA sequence 
which coniprises a DNA sequence encoding a heterologous 
fusion protein of any one of Claims 1 through 10. 
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12. A polynucleotide encoding a heterologous fusion protein 
of any one of Claims 1 through 11. 

13. A polynucleotide which contprises a DNA sequence 

5 selected from the group consisting of: 

a) SEQ ID NO: 2 

b) SEQ ID N0:3 

c) SEQ ID lSrO:4 

d) SEQ ID N0:5 
10 e) SEQ ID NO: 6 

f) SEQ ID NO: 7 

g) SEQ ID N0:8 

h) SEQ ID N0:9 

i) SEQ ID NO:10 
gg 15 j) SEQ ID N0:11 
h 3c) SEQ ID N0:12 
gP 1) SEQ ID NO: 13 
^ ml SEQ ID NO: 14 

n) SEQ ID NO: 15 

k 20 o) SEQ ID NO: 16, 

^ fused in- frame to a DNA encoding a protein selected from the 

^ group consisting of: 
U a) human albumin, 

ifi b) human albumin analog; and 

25 c) fragments of human albumin. 



ry 



14. The polynucleotide of Claim 13, wherein the DNA fused 
in- frame comprises SEQ ID NO: 17. 

30 15 • The heterologous fusion protein of any one of Claims 1 
through 11 wherein the polypeptide is human albiomin. 

16. The heterologous fusion protein of any one of Claims 1 
through 11 wherein the polypeptide is an N-terminal fragment 

35 of albumin. 

17. A method for increasing neutrophil levels in a mammal 
coznprising administering a therapeutically effective amoiint 
of the heterologous fusion protein of any one of Claims 1 

40 through 11, 15 and 16. 
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18. The use of the heterologous fusion protein as claimed 
in any one of Claims 1 through 11, 15 and 16 for the 
manufacture of a medicament for the treatment of patients 
with insufficient circulating neutrophil levels. 

19. Use of a heterologous fusion protein of any one of 
Claims 1 through 11, 15, and 16 as a medicament. 

20. Use of a heterologous fusion protein of any one of 
Claims 1 through 11, 15, and 16 for the treatment of 
patients with insufficient circulating neutrophil levels. 

21. A pharmaceutical forxrailation adapted for the treatment 
of patients with insufficient neutrophil levels comprising a 
heterologous fusion protein of any one of Claims 1 through 
11, 15, and 16. 

22. A heterologous fusion protein comprising a 
hyperglycosylated G-CSF analog fused to a polypeptide 
selected from the group consisting of: 

a) the Fc portion of an immunoglobulin; 

b) an analog of the Fc portion of an immunoglobulin; 
and 

c) fragments of the Fc portion of an immunoglobulin. 

23. The heterologous fusion protein of Claim 22, wherein 
the hyperglycosylated G-CSF analog is fused to the 
polypeptide via a peptide linker. 

24. The heterologous fusion protein of the Claim 23 wherein 
the peptide linker is selected from the group consisting of: 

a) a glycine rich peptide; 

b) a peptide having the sequence [Gly-Gly-Gly-Qly- Serin 
where n is 1, 2, 3, 4, or 5; and 

c) a peptide having the sequence [Gly-Gly-Gly-Gly-SerJa. 
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25. The heterologous fusion protein of Claims 22, 23 or 24, 
wherein the hyperglycosylated 6-CSP analog cornprises the 
amino acid sequence of the formula I: [SEQ ID NO: 1] 
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Xaa at position 93 is 6lu or Asn 

Xaa at position 94 is Gly or any other amino acid except 
Pro ; 

Xaa at position 95 is lie, Asn, Ser, or Tbx; 
5 Xaa at position 97 is Pro, Ser, Thr, or Asn; 
Xaa at position 133 is Thr or Asn; 

Xaa at position 134' is Gin or any other amino acid except 
Pro; 

Xaa at position 135 is Gly, Ser, or Thr 
10 Xaa at position 141 is Ala or Asn; 

Xaa at position 142 is Ser or any other amino acid except 
Pro; and 

Xaa at position 143 is Ala, Ser, or Thr; 

15 and wherein; 

Xaa at positions 37, 38, and 39 constitute region 1; 

Xaa at positions 58, 59, and 60 constitute region 2; 

Xaa at positions 59, 60, and 61 constitute region 3; 

Xaa at positions 60, 61, and 62 constitute region 4; 

^ 20 Xaa at positions 61, 62, and 63 constitute region 5; 

P Xaa at positions 62, 63, and 64 constitute region 6; 

y Xaa at positions 63, 64, and 65 constitute region 7; 



a 



Xaa at positions 64, 65, and 66 constitute region 8; 
Xaa at positions 67, 68, and 69 constitute region 9; 
ftf 25 Xaa at positions 93, 94, and 95 constitute region 10; 

Xaa at positions 94, 95, and Ser at position 96 

constitute region 11; 
Xaa at positions 95, and 97, and Ser at position 96 
constitute region 12; 
30 Xaa at positions 133, 134, and 135 constitute 

region 13; 

Xaa at positions 141, 142, and 143 constitute 
region 14; 



35 



and provided that at least one of regions 1 through 14 
comprises the sequence Asn Xaal Xaa2 wherein Xaal is any 
amino acid except Pro and Xaa2 is Ser or Thr. 
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26. The heterologous fusion protein of Claim 25 wherein any 
two regions of regions 1 through 14 comprise the sequence 
Asn Xaal Xaa2 wherein Xaal is any amino acid except Pro and 

5 Xaa2 is Ser or Hhr. 

27. Hie heterologous fusion protein of Claim 25 wherein any 
three regions of regions 1 through 14 comprise the sequence 
Ash xaal Xaa2 wherein Xaal is any amino acid except Pro and 

10 Xaa2 is Ser or Thr. 

28. The heterologous fusion protein of Claim 25 wherein any 
four regions of regions 1 through 14 comprise the sequence 
Asn Xaal Xaa2 wherein Xaal is any amino acid except Pro and 



M 15 Xaa2 is Ser or Thr 

^ 29. The heterologous fusion protein of Claim 25 wherein the 

*^ hyperglycosylated 6-CSP analog is selected from the group 

y consisting of: 

S 20 a) G-CSF[A37N,Y39T1 

^ b) G-CSF[P57V,W58N,P60T] 

rU c) G-CSF[P60N,S62T] 

d) G-CSP[S63N,P65T] 

e) G-CSF(Q67N,L69T] 
25 f) 6-CSP [E93N, 195T] 

g) 6-CSP [T133N,Q135T] 

h> Q-CSP[A141N,A143T] 

i) G-CSP [A37N, Y39T, P57V,W58N, P60T] 

j) G-CSF[A37N,Y39T,P60N,S62Tl 
30 k) G-CSP [A37N/Y39T,S63N,P65T1 

1) 6-CSF[A37N,Y39T,Q67N,L69Tl 

m) G-CSF[A37N,Y39T,B93N, I95T1 

n) G-CSF[A37N/Y39T,T133N,G135T] 

o) G-CSF[A37N,Y39T,A141N,A143T1 
35 p) G-CSF[A37N,Y39T,P57V,W58N,P60T,S63N,P65T] 

q) Q-CSF[A37N,Y39T,P57V,W58N,P60T,Q67N,L69T] 
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r ) G-CSF [A37N. Y39T, S63N, P65T, E93N, I95T1 

30. The heterologous fusion protein of Claim 29 wherein the 
hyperglycosylated G-CSP analog is G- 

CSF tA37N, Y39T, P57V, W58N, P60T, Q67N, L69T] 

31. The heterologous fusion protein of Claim 29 wherein the 
hyperglycosylated G-CSF analog is G- 

CSF tA37N, Y39T, S63N, P65T, E93N, I95T1 



32. The heterologous fusion protein of any one of Claims 22 

^ through 31 wherein the polypeptide is the Fc portion of an 

y Ig selected from the group consisting of: IgGl, IgG2, IgG3, 

S lgG4, igE, igA, igD, or IgM. 

4 33. The heterologous fusion protein of Claim 32 wherein the 

? polypeptide is the Fc portion of an Ig selected from the 

?3 group consisting of: IgGl. IgG2, lgG3, and IgG4. 

in 20 34. The heterologous fusion protein of Claim 33 wherein the 
II polypeptide is the Fc portion of an IgGl immunoglobulin. 

35. The heterologous fusion protein of Claim 33 wherein the 
polypeptide is the Fc portion of an IgG4 

2 5 immunoglobul in . 

36. The heterologous fusion protein of any one of Claims 22 
through 35 wherein the Fc portion is a human IgG protein. 

30 37. The heterologous fusion protein of any one of Claims 22 
. through 36 wherein the Fc portion comprises hinge, CH2, and 
CH3 domains. 

38. The heterologous fusion protein of Claim 34 wherein the 
35 polypeptide has the sequ«ice of SEQ ID NO: 33. 



25 
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39. A polynucleotide encoding a heterologous fusion protein 
of Claim 34, wherein the polynucleotide comprises SEQ ID NO: 
22. 

5 40. A polynucleotide encoding a heterologous fusion protein 
of any one of Claims 22 through 38. 

41. A vector ccmiprising the polynucleotide of Claim 12, 13, 
14, 39 and 40. 

10 

42. A host cell coinprising the vector of Claim 41. 

a 

y 43. A host cell expressing at least one heterologous fusion 

^ protein of any one of Claims 1 through 11, 15, 16, and 22 

u3 15 through 38. 



44. The host cell of Claim 43 wherein said host cell is a 
CHO cell. 



in 20 45. A process for producing a heterologous fusion protein 
^ comprising the steps of transcribing and translating a 

polynucleotide of Claim 40 under conditions wherein the 
heterologous fusion protein is expressed in detectable 
amounts • 



46. A method for increasing neutrophil levels in a mammal 
comprising the administration of a therapeutically effective 
axQovmt of the heterologous fusion protein of any one of 
Claims 27 through 36. 

47. The use of the heterologous fusion protein as claimed 
in any one of Claims 22 through 29 for the manufacture of a 
medicament for the treatment of patients with insufficient 
circulating neutrophil levels. 



35 
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48 • Use of a heterologous fusion protein of any one of 
Claims 22 through 29 as a medicament. 

49 . Use of a heterologous fusion protein of any one of 
Claims 22 through 29 for the treatment of patients with 
insufficient circulating neutrophil levels* 

50 • A pharmaceutical formulation adapted for the treatment 
of patients with insufficient neutrophil levels comprising a 
heterologous fusion protein of any one of Claims 22 through 
29. 

51. A heterologous fusion protein as hereinbefore described 
with reference to any one of the Examples. 
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Abstract 

Hie present invention encompasses heterologous fusion 
proteins comprising a hyperglycsoylated G-CSF analog fused 
to proteins such as alb\amin and the Fc portion of an 
immunoglobulin which act to extend the in vivo half-life of 
the protein coinpared to native G-CSF. These fusion proteins 
are particularly suited for the treatment of conditions 
treatable by stimulation of circulating neutrophils, such as 
after chemotherapy regimens or in chronic congenital 
neutropenia* 
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Fig. 1 
1/1 

IS 10 15 

•Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys 

20 25 30 

Cys Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin 

35 Rl 40 4S 

Glu Lys Leu CysViaa-Xafi-Zasf Lys Leu Cys His Pro Glu Glu Leu Val 

50 55 »3__fifl V ^5 

Leu Leu Gly His Ser Leu Gly He Xaa Xaaf Xaa «aa Xaa^ XaajflCaa Xaa XaaySer 



R9 70 75 80 

vXaa Xaa Xaa/ Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser 

85 90 Ria, 



Glv Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala LeuyXaa Xaa /ka^ Ser Xaa^ 

Rll 

100 105 110 

Xaa Xaa Gly Pro 'Thx Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 

115 120 125 

Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 

130 R13 135 140 R14 

Ala Leu Gin Pro iXaa Xaa Xag y Ala Met Pro Ala PhevXag^_Sas.JSa3/ Phe 

145 150 155 160 

Gin Arg Arg Ala Gly Gly VslL Leu Val Ala Ser His Leu Aln Ser Phe 

165 170 
Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
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SEQUENCE LISTING 



<120> 



<130> 



Cn <1€0> 

a 

M <170> 



^ <210> 

a • 
iiU <2ii> 
m 

yt <2i2> 

a 

Hi <213> 



<220> 



<223> 
<220> 



<221> 



<222> 



<223> 



<220> 



Beals, iTohn 
luchibhotla, Uma 

HETEROLCXSOUS G-CSF FUSION PROTEINS 
P- 1564 8 



Patentin version 3 . 1 

1 

174 
PRT 

Artificial Sequence 

synthetic construct 
MISC FEATURE 



(17) (17) 

Xaa at position 17 is Cys, Ala, Leu,. Ser, or Glu; 
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^^i21> MISC_FEATaRE 
<222> (37) . . (37) 

<223> Xaa at position 37 is Ala or Asn; 



<220> 

<221> MISC_FEATURE 

<222> (38) . . (38) 

<223> Xaa at position 38 is Thr, or any other amino acid exept Pro; 



^ <220> 

^ <221> MISC_PEATURE 

<222> (39) . . (39) 

^ <223> Xaa at position 39 is Tyr, Thr, or Ser; 



m <220> 

w 

m 

p <221> MISC_FEATURE 

<222> (57) . . (57) 

<223> Xaa at position 57 is Pro or Val; 



<220> 

<22 1 > M1SC_FEATURE 

<222> (58).. (58) 

<223> Xaa at position 58 is Trp or Asn; 



<220> 
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^|^1> MISC_FEATURE 
<222> (59) . . (59) 

<223> Xaa at position 59 is Ala or any other amino acid except Pro; 



<220> 

<221> MISC_PEATURE 

<222> (60) . . (60) 

<223> Xaa at position 60 is Pro/ Thr, Asn, or Ser; 



^:|<220> 

^<221> MISC^FEATURE 

fi<222> (61) . . (61) 

)§ 

^p<223> Xaa at position 61 is Leu, or any other amino acid except 
ggPro; 

g3<220> 

m 

p <2 2 1 > MI SC^FEATURE 

<222> (62) . . (62) 

<223> Xaa at position 62 is Ser or Thr; 



<220> 

<221> MISC_FEATURE 

<:222> (63) . . (63) 

<223> Xaa at position 63 Ser or Asn; 



<220> 
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il^l> MISC_FEATURE 

<222> (64).. (64) 

<223> Xaa at position 64 is Cys or any other amino acid except Pro; 
<220> 

<221> MISC_FEATORB 

<222> (65) . . (65) 

<223> Xaa at position 65 is Pro, Ser, or Thr; 



<220> 

fS3 <221> MISC_FEATURE 

^ <222> (66) . . (66) 

«B <223> Xaa at position 66 is Ser or Thr; 

m 

<220> 



<221> MISC FEATURE 



|3 <222> (67) . . (67) 

<223> Xaa at position 67 is Gin or Asn; 



<220> 

<221> MISC_FBATURE 

<222> (68) • . (68) 

<223> Xaa at position 68 is Ala or any other amino acid except Proj 
<220> • 

<221> MISC_FEATURE 
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1^2> (69).. (69) 

<223> Xaa at position 69 is Leu, Thr, or Ser; 



<220> 

<221> MISC_FEATURE 
<222> (93).. (93) 

<223> Xaa at position 93 is 61u or Asn; 
<220> 

|'<221> MISC FEATURE 
P 

y<222> (94).. (94) 

^<223> Xaa at position 94 is Gly or any other amino acid except Pro; 
L<220> 

Q 

y<221> MISC_FEATORE 
^<222> (95).. (95) 

m 

*^<223> Xaa at position 95 is lie, Asn, Ser, or Thr; 



<220> 

<221> MISC_PEATUKE 

<222> (97).. (97) 

<223> Xaa at position 97 is Pro, Ser, Thr, or Asn; 



<220> 

<221> MISC FEATURE 
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^2> (133).. (133) 

<223> Xaa at position 133 is Thr or Asn; 



<220> 

<221> MISC_PEATURJB 
<222> (134) . . (134) 

<223> Xaa at position 134 is Gin or any other amino acid except 
Pro; 



•<:220> 

|<221> MISC FEATURE 



l(i<222> (135).. (135) 

l*^<223> Xaa at position 135 is Gly, Ser, or Thr; 



* <220> 

W<221> MISC FEATURE 

U1 <222> (141) . . (141) 

W<223> Xaa at position 141 is Ala or Asn; 



<220> 

<221> MISC_FEATURE 

<222> (142) . . (142) 

<223> Xaa at position 142 is Ser or any other amino acid except 
Pro; 



<220> 

<221> MISC^FEATURB 
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ifc2> (143) . . (143) 

<223> Xaa at position 143 is Ala, Ser, or Thr. 
<400> 1 

Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys 
1 5 10 15 



Xaa Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin 
20 25 30 



Glu Lys Leu Cys Xaa Xaa Xaa Lys Leu Cys His Pro Glu Glu Leu Val 
35 40 45 

Q 

ULeu Leu Gly His Ser Leu Gly He Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
P 50 55 60 

^Xaa Xaa Xaa Xaa Xaa Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser 
^SS 70 75 80 

^Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Xaa Xaa Xaa Ser 
M 85 90 95 

m 

e 

^'^Xaa Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 
100 105 110 



Phe Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 
115 120 125 



Ala Leu Gin Pro Xaa Xaa Xaa Ala Met Pro Ala Phe Xaa Xaa Xaa Phe 
130 135 140 



Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 
145 150 155 160 



Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 

165 170 



Page 7 




P-15648.ST25.txt 




<210> 2 

<211> 1044 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 2 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 



ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 



cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 
180 



cgtcgcgagg tcgagaagct gtgtgccacc tacaagctgt gccaccccga ggagctggtg 
23240 

ggctcttcgaca cacggtggat gttcgacacg gtggggctcc tcgaccacct gctcggacac 

& 

jytctctgggca tcccctgggc tcccctgagc agctgcgacg agcctgtgac agacccgtag 

360 

gggacccgag gggactcgtc gacgcccagc caggccctgc agctggcagg ctgcttgagc 
420 

caactccata gcgggtcggt ccgggacgtc gaccgtccga cgaactcggt tgaggtatcg 
480 

ggccttttcc tctaccaggg gctcctgcag gccctggaag ggatctcccc ggaaaaggag 
540 

atggtccccg aggacgtccg ggaccttccc tagaggcccg agttgggtcc caccttggac 
600 

. acactgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 
660 

ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 

Page 8 



P-15648.ST25.txt 

aaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 
, 780 

aaccagaccg ccatgccggc cttcgcctct gctttccggg acgtcgggtt ggtctggcgg 
840 

tacggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 
900 

ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 
960 ■ 

ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 
1020 

cagaattccg tggaacgggt cggg 

0 

W<210> 3 

^<2H> 1044 

^<212> DNA 
gi 

P<213> Artificial Sequence 
W 

|<220> 
fU 

<223> synthetic construct 
<400> 3 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 
60 

ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 
120 

cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 
180 

cgtcgcgagg tcgagaagct gtgtgccacc tacaagctgt gccaccccga ggagctggtg 
240 

Gtcttcgaca cacggtggat gttcgacacg gtggggctcc tcgaccacct gctcggacac 
300 
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•ctgggca tcccctgggc tcccctgagc agctgcgacg agcctgtgac agacccgtag 

gggacccgag gggactcgtc gacgcccagc caggccctgc agctggcagg ctgcttgagc 
420 

caactccata gcgggtcggt ccgggacgtc gaccgtccga cgaactcggt tgaggtatcg 
480 

ggccttttcc tctaccaggg gctcctgcag gccctggaag ggatctcccc ggaaaaggag 
540 

atggtccccg aggacgtccg ggaccttccc tagaggcccg agttgggtcc caccttggac 
600 



acactgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 
660 

ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 
|720 

u 

Saaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 
Jj780 

^acccagggtg ccatgccggc cttcaactct accttccggg acgtcgggtg ggtcccacgg 
^840 

@ 

^tacggccgga agttgagatg gaagcagcgc cgggcaggag gggtcctggt tgcctcccat 

y=|ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 
p960 

ru 

ctggaggtgt: cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 
1020 

cagaattccg tggaacgggt cggg 
1044 



<210> 4 
<211> 1044 
<212> DNA ' 

<213> Artificial Seqpience 
<220> 
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synthetic construct 
<400> 4 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 
60 

ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 
120 

cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 
180 

cgtcgcgagg tcgagaagct gtgtaacacc accaagctgt gccaccccga ggagctggtg 
240 

ctcttcgaca cattgtggtg gttcgacacg gtggggctcc tcgaccacct gctcggacac 
300 

^tctctgggca tcccctgggc tcccctgagc agctgcgacg agcctgtgac agacccgtag 
1360 



r«gggacccgag gggactcgtc gacgcccagc caggccctgc agctggcagg ctgcttgagc 

icaactccata gcgggtcggt ccgggacgtc gaccgtccga cgaactcggt tgaggtatcg 
!l 480 

Ijggccttttcc tctaccaggg gctcctgcag gccctggaag ggatctcccc ggaaaaggag 

^atggtccccg aggacgtccg ggaccttccc tagaggcccg agttgggtcc caccttggac 
jSJsoo 

acactgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 
660 

ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 
720 

aaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 
780 

acccagggtg ccatgccggc cttcgcctct gctttccggg acgtcgggtg ggtcccacgg 
840 

tacggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 
900 

ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 
960 
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ggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 
1020 

cagaattccg tggaacgggt cggg 
1044 



<210> 5 

<211> 1044 

<212> DNA 

<213> Artificial Sequence 



^ <223> synthetic construct 
ffl 

^ <400> 5 

^ acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 

I 60 

m 

^ ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 

w 

g»l cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 

h 

cgtcgcgagg tcgagaagct gtgtgccacc tacaagctgt gccaccccga ggagctggtg 
240 

ctcttcgaca cacggtggat gttcgacacg gtggggctcc tcgaccacct gctcggacac 
300 

tctctgggca tcccctgggc taacactagc agctgcgacg agcctgtgac agacccgtag 
360 

gggacccgat tggactcctc gacgcccagc caggccctgc agctggcagg ctgcttgagc 
420 

caactccata gcgggtcggt ccgggacgtc gaccgtccga cgaactcggt tgaggtatcg 
480 

ggccttttcc tctaccaggg gctcctgcag gccctggaag ggatctcccc ggaaaaggag 
540 

atggtccccg aggacgtccg ggaccttccc tagaggcccg agttgggtcc caccttggac 
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acactgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 
660 

ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 
720 

aaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 
780 

acccagggtg ccatgccggc cttcgcctct gctttccggg acgtcgggtg ggtcccacgg 
840 

tacggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 
900 

ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 

? 

I^ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 

^«cagaattccg tggaacgggt cggg 
IS 1044 



^<210> 6 
y<211> 1044 
M<212> DNA 



^ <213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 6 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 
60 

ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 
120 

cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 
180 
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cgcgagg tcgagaagct gtgtgccacc tacaagctgt gccaccccga ggagctggtg 



ctcttcgaca cacggtggat gttcgacacg gtggggctcc tcgaccacct gctcggacac 
300 

tctctgggca tcccctgggc tcccctgagc aattgcgacg agcctgtgac agacccgtag 
360 

gggacccgag gggactcgtt aacgaccagc caggccctgc agctggcagg ctgcttgagc 
420 

caactccata gctggtcggt ccgggacgtc gaccgtccga cgaactcggt tgaggtatcg 
480 

ggccttttcc tctaccaggg gctcctgcag gccctggaag ggatctcccc ggaaaaggag 
540 



Seoo 



atggtccccg aggacgtccg ggaccttccc tagaggcccg agttgggtcc caccttggac 

y 

acactgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 

Jg-Ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 
^720 

m aaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 
W780 

0 

i|j acccagggtg ccatgccggc cttcgcctct gctttccggg acgtcgggtg ggtcccacgg 
840 

rv 

tacggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 
900 

ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 
960 

ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 
1020 

cagaattccg tggaacgggt cggg 
1044 

<210> 7 
<211> 1044 
<212> DNA 
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<213> Artificial Sequence 



<220> 



<223> synthetic construct 
<400> 7 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 
60 

ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 
. 120 

cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 
180 

^ cgtcgcgagg tcgagaagct gtgtgccacc tacaagctgt gccaccccga ggagctggtg 
S 240 

^ ctcttcgaca cacggtggat gttcgacacg gtggggctcc tcgaccacct gctcggacac 
M 300 

tctctgggca tcgttaacgc taccctgagc agct^gacg agcctgtgac agacccgtag 
^ 360 

» 

^ caattgcgat gggactcgtc gacgcccagc caggccctgc agctggcagg ctgcttgagc 
U 420 

caactccata gcgggtcggt ccgggacgtc gaccgtccga cgaactcggt tgaggtatcg 
480 

ru 

ggccttttcc tctaccaggg gctcctgcag gccctggaag ggatctcccc ggaaaaggag 
540 



atggtccccg aggacgtccg ggaccttccc tagaggcccg agttgggtcc caccttggac 
600 



acactgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 
660 



ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 
720 



aaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 
780 

acccagggtg ccatgccggc cttcgcctct gctttccggg acgtcgggtg ggtcccacgg 
840 
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acggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 
900 

ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 
960 

ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 
1020 

cagaattccg tggaacgggt cggg 
1044 



<210> 8 

<211> 1044 

<212> DNA 

^ <213> Artificial Sequence 

^ <220> 

=^ 

^ <223> synthetic construct 
13 <400> 8 

y acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 
^ 60 

^ ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 

fyi20 

cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 
180 

cgtcgcgagg tcgagaagct gtgtgccacc tacaagctgt gccaccccga ggagctggtg 
240 

ctcttcgaca cacggtggat gttcgacacg gtggggctcc tcgaccacct gctcggacac 
300 

tctctgggca tcccctgggc tcccctgagc agctgcgacg agcctgtgac agacccgtag 
360 

gggacccgag gggactcgtc gacgcccagc aacgccaccc agctggcagg ctgcttgagc 
420 

caactccata gcgggtcgtt gcggtgggtc gaccgtccga cgaactcggt tgaggtatcg 
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ggccttttcc tctaccaggg gctcctgcag gccctggaag ggatctcccc ggaaaaggag 
540 

atggtccccg aggacgtccg ggaccttccc tagaggcccg agttgggtcc caccttggac 
600 

acactgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 
660 

ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 
720 

siaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 
780 

acccagggtg ccatgccggc cttcgcctct gctttccggg acgtcgggtg ggtcccacgg 
840 

St. 

P tacggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 
W 900 

gn 

ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 
»fi 960 

f- 

i& ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 

p 

y cagaattccg tggaacgggt cggg 
g 1044 

a 

<210> 9 

<211> 1044 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 9 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 
60 
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iM^acggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 
120 

cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 
180 

cgtcgcgagg tcgagaagct gtg*tgccacc tacaagctgt gccaccccga ggagctggtg 
240 

ctcttcgaca cacggtggat gttcgacacg gtggggctcc tcgaccacct gctcggacac 
300 

tctctgggca tcccctgggc tcccctgagc agctgcgacg agcctgtgac agacccgtag 
360 

gggacccgag gggactcgtc gacgcccagc caggccctgc agctggcagg ctgcttgagc 
420 

caactccata gcgggtcggt ccgggacgtc gaccgtccga cgaactcggt tgaggtatcg 
480 

W ggccttttcc tctaccaggg gctcctgcag gccctgaacg ggacctcccc ggaaaaggag 
fl 540 

^tgatccccg aggacgtccg ggacttgccc tggaggcccg agttgggtcc caccttggac 

f' 600 

m 

^ acactgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 

§3 660 

W 

O ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 
m 720 

fy aaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 
780 

acccagggtg ccatgccggc cttcgcctct gctttccggg acgtcgggtg ggtcccacgg 
840 

tacggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 
900 

ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 
960 

ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatagcg 
1020 

cagaattccg tggaacgggt cggg 
1044 
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.0> 10 
<211> 1044 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 10 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 
60 

Sggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 

ffl 

^cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 

Scgtcgcgagg tcgagaagct gtgtaacacc accaagctgt gccaccccga ggagctggtg 
^240 

yctcttcgaca cattgtggtg gttcgacacg gtggggctcc tcgaccacct gctcggacac 

gtctctgggca tcccctgggc tcccctgagc agctgcgacg agcctgtgac agacccgtag 
^260 

gggacccgag gggactcgtc gacgcccagc caggccctgc agctggcagg ctgcttgagc 
420 

caactccata gcgggtcggt ccgggacgtc gaccgtccga cgaactcggt tgaggtatcg 
480 

ggccttttcc tctaccaggg gctcctgcag gccctggaag ggatctcccc ggaaaaggag 
540 

atggtccccg aggacgtccg ggaccttccc tagaggcccg agttgggtcc caccttggac 
600 

acactgcagc tggacgtcgc cgacgggctc aacccagggt: ggaacctgtg tgacgtcgac 
660 

ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 
720 
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icggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 
780 

aaccagaccg ccatgccggc cttcgcctct gctttccggg acgtcgggtt ggtctggcgg 
840 

tacggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 
900 

ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 
960 

ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 
1020 - 

cagaattccg tggaacgggt cggg 
1044 



J^<210> 11 

^<211> 1044 

j|<212> DNA 

y*<213> Artificial Sequence 

h 



J^<220> 



ru 



<223> synthetic construct 
<400> 11 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 
60 

ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 
120 

cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 
180 

cgtcgcgagg tcgagaagct gtgtaacacc accaagctgt gccaccccga ggagctggtg 
240 

ctcttcgaca cattgtggtg gttcgacacg gtggggctcc tcgaccacct gctcggacac 
300 

tctctgggca tcccctgggc tcccctgagc agctgcgacg agcctgtgac agacccgtag 
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gggacccgag gggactcgtc gacgcccagc caggccctgc agctggcagg ctgcttgagc 
420 

caactccata gcgggtcggt ccgggacgtc gaccgtccga cgaactcggt tgaggtatcg 
480 

ggccttttcc tctaccaggg gctcctgcag gccctggaag ggatctcccc ggaaaaggag 
540 

atggtccccg aggacgtccg ggaccttccc tagaggcccg agttgggtcc caccttggac 
600 

acactgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 
660 

ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 
720 

g aaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 

W 780 

61 

jt* acccagggtg ccatgccggc cttcaactct accttccggg acgtcgggtg ggtcccacgg 
^ 840 • 



tacggccgga agttgagatg gaagcagcgc cgggcaggag gggtcctggt tgcctcccat 
R! 900 

q 

Wctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtct:cgaag 
a 960 

in 

^5 ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 
1020 

cagaattccg tggaacgggt cggg 
1044 



<210> 12 

<211> 1044 

<212> DNA 

<213> Artificial Sequence 



<220> 
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^1^3 > synthetic construct 
<400> 12 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 
60 

ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 
120 



cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 
180 



cgtcgcgagg tcgagaagct gtgtaacacc accaagctgt gccaccccga ggagctggtg 
240 



ctcttcgaca cattgtggtg gttcgacacg gtggggctcc tcgaccacct gctcggacac 
300 



tctctgggca tcgttaacgc taccctgagc agctgcgacg agcctgtgac agacccgtag 

fi 360 

W caattgcgat gggactcgtc gacgcccagc caggccctgc agctggcagg ctgcttgagc 
@i420 

^Icaactccata gcgggtcggt ccgggacgtc gaccgtccga cgaactcggt tgaggtatcg 
#480 

m 

^ ggccttttcc tctaccaggg gctcctgcag gccctggaag ggatctcccc ggaaaaggag 

53 540 

W 

©atggtccccg aggacgtccg ggaccttccc tagaggcccg agttgggtcc caccttggac 

in 600 

a 

^^acactgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 
660 

ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 
720 



aaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 
780 



acccagggtg ccatgccggc cttcgcctct gctttccggg acgtcgggtg ggtcccacgg 
840 



tacggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 
900 



ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 
960 



Page 22 



P-15648.ST25,txt 

•ggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 
20 



cagaattccg tggaacgggt cggg 
1044 



•<21p> 13 

<2H> 1044 

<212> DNA 

<213> Artificial Seq[uence 
<220> 

^ <223> sjrnthetic construct 
lii 

3« <400> 13 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 

^ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 
120 

|;y cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 
^180 

in ^ ^ 

gcgtcgcgagg tcgagaagct gtgtaacacc accaagctgt gccaccccga ggagctggtg 
^^cttcgaca cattgtggtg gttcgacacg gtggggctcc tcgaccacct gctcggacac 

tctctgggca tcccctgggc tcccctgagc agctgcgacg agcctgtgac agacccgtag 

3 SO 

gggacccgag gggactcgtc gacgcccagc aacgccaccc agctggcagg ctgcttgagc 

4 2 0 

caactccata gcgggtcgtt gcggtgggtc gaccgtccga cgaactcggt tgaggtatcg 
480 

ggccttttcc tctaccaggg gctcctgcag gccctggaag ggatctcccc ggaaaaggag 
540 

atggtccccg aggacgtccg ggaccttccc tagaggcccg agttgggtcc caccttggac 
600 
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actgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 
660 

ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 
720 

aaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 
780 * 

acccagggtg ccatgccggc cttcgcctct gctttccggg acgtcgggtg ggtcccacgg 
840 

tacggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 
900 

ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 
960 

PR 

£ ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 
H 1020 

1044 



cagaattccg tggaacgggt cggg 



J <210> 14 
® <211> 1044 
II <212> DNA 

j!y<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 14 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 
60 

ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 
120 

cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 
180 

cgtcgcgagg tcgagaagct gtgtaacacc accaagctgt gccaccccga ggagctggtg 
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ctcttcgaca cattgtggtg gttcgacacg gtggggctcc tcgaccacct gctcggacac 
300 

tctctgggca tcccctgggc tcccctgagc agctgcgacg agcctgtgac agacccgtag 
360 

' gggacccgag gggactcgtc gacgcccagc caggccctgc agctggcagg ctgcttgagc 
420 

caactccata gcgggtcggt ccgggacgtc gaccgtccga cgaactcggt tgaggtatcg 
480 

ggccttttcc tctaccaggg gctcctgcag gccctggaag ggatctcccc ggaaaaggag 
540 

atggtccccg aggacgtccg ggaccttccc tagaggaacg gtaccggtcc caccttggac 

l^acactgcagc tggacgtcgc cgacttgcca tggccagggt ggaacctgtg tgacgtcgac 

'oCtgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcfccct 
15 720 • 



aaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 
1780 

w 

ggacccagggtg ccatgccggc cttcgcctct gctttccggg acgtcgggtg ggtcccacgg 

0 

jnjtacggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 
^^900 

ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 
960 

ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 
1020 

cagaattccg tggaacgggt cggg 
1044 



<210> 15 
<211> 1044 
• <212> DNA 
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13 > Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 15 

acccccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 
60 

ggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 
120 

cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 
180 

cgtcgcgagg tcgagaagct gtgtaacacc accaagctgt gccaccccga ggagctggtg 
Sn 240 

l^ctcttcgaca cattgtggtg gttcgacacg gtggggctcc tcgaccacct gctcggacac 
g300 

mO 

jptctctgggca tcgttaacgc taccctgagc agctgcgacg agcctgtgac agacccgtag 

pcaattgcgat gggactcgtc gacgcccagc aacgccaccc agctggcagg ctgcttgagc 

P 

i^jrj caactccata gcgggtcgtt gcggtgggtc gaccgtccga cgaactcggt tgaggtatcg 
gg480 

fU 

ggccttttcc tctaccaggg gctcctgcag gccctggaag ggatctcccc ggaaaaggag 
540 

atggtccccg aggacgtccg ggaccttccc tagaggcccg agttgggtcc caccttggac 
600 

. acactgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 
660 

ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 
720 

aaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 
780 

acccagggtg ccatgccggc cttcgcctct gctttccggg acgtcgggtg ggtcccacgg 
840 
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ij^ggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 

ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 
960 

ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 
1020 

cagaattccg tggaacgggt cggg 
1044 



<210> 16 

<211> 1044 

<212> DNA 

^ <213> Artificial Sequence . ~ 

y 

^ <220> 

^ <223> synthetic construct 
* <400> 16 

Id ^c°°ccctgg gccctgccag ctccctgccc cagagcttcc tgctcaagtg gggggacccg 
p ^° 

iSggacggtcga gggacggggt ctcgaaggac gagttcgcct tagagcaagt gaggaagatc 

cagggcgatg gcgcagcgct ccagcggaat ctcgttcact ccttctaggt cccgctaccg 
180 

cgtcgcgagg tcgagaagct gtgtaacacc accaagctgt gccaccccga ggagctggtg 
240 

ctcttcgaca cattgtggtg gttcgacacg gtggggctcc tcgaccacct gctcggacac 
300 

tctctgggca tcccctgggc tcccctgagc aattgcgacg agcctgtgac agacccgtag 
360 

gggacccgag gggactcgtt aacgaccagc caggccctgc agctggcagg ctgcttgagc 
420 

caactccata gctggtcggt ccgggacgtc gaccgtccga cgaactcggt tgaggtatcg 
480 
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^^ccttttcc tctaccaggg gctcctgcag gccctgaacg ggacctcccc ggaaaaggag 
540 

atggtccccg aggacgtccg ggacttgccc tggaggcccg agttgggtcc caccttggac 
600 

acactgcagc tggacgtcgc cgacgggctc aacccagggt ggaacctgtg tgacgtcgac 
660 

ctgcagcggc tgtttgccac caccatctgg cagcagatgg aagaactggg aatggcccct 
720 

aaacggtggt ggtagaccgt cgtctacctt cttgaccctt accggggagc cctgcagccc 
780 

acccagggtg ccatgccggc cttcgcctct gctttccggg acgtcgggtg ggtcccacgg 
840 

ffl tacggccgga agcggagacg aaagcagcgc cgggcaggag gggtcctggt tgcctcccat 
p 900 

m 

tV ctgcagagct tcgtcgcggc ccgtcctccc caggaccaac ggagggtaga cgtctcgaag 
Z 960 

jp 

% ctggaggtgt cgtaccgcgt cttaaggcac cttgcccagc ccgacctcca cagcatggcg 
J* 1020 

€3 

I I cagaattccg tggaacgggt cggg 
a 1044 

m 

g<210> 17 

<211> 1762 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 17 

gatgcgcaca agagtgaggt tgctcatcgg tttaaagatt tgggagaaga aaatttcaaa 
60 

gccttggtgt tgattgcctt tgctcagtat cttcagcagt gtccatttga agatcatgta 
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m 



aaattagtga atgaagtaac tgaatttgca aaaacatgtg ttgctgatga gtcagctgaa 
180 

aattgtgaca aatcacttca tacccttttt ggagacaaat tatgcacagt tgcaactctt 
240 

cgtgaaacct atggtgaaat ggctgactgc tgtgcaaaac aagaacctga gagaaatgaa 
300 

tgcttcttgc aacacaaaga tgacaaccca aacctccccc gattggtgag accagaggtt 
360 

gatgtgatgt gcaetgcttt tcatgacaat gaagagacat ttttgaaaaa atacttatat 
420 

gaaattgcca gaagacatcc ttacttttat gccccggaac tccttttctt tgctaaaagg 

CP 

R tataaagctg cttttacaga atgttgccaa gctgctgata aagctgcctg cctgttgcca 

W540 

^ aagctcgatg aacttcggga tgaagggaag gcttcgtctg ccaaacagag actcaagtgt 
J 600 

^ gccagtctcc aaaaatttgg agaaagagct ttcaaagcat gggcagtagc tcgcctgagc 

w • 

^cagagatttc ccaaagctga gtttgcagaa gtttccaagt tagtgacaga tcttaccaaa 

S-'^o 

6 

stiigtccacacgg aatgctgcca tggagatctg cttgaatgtg ctgatgacag ggcggacctt 
* 780 

gccaagtiata tctgtgaaaa tcaagattcg atctccagtia aactigaagga atgctgtgaa 
840 

aaacctctgt tggaaaaatc ccactgcatt gccgaagtgg aaaatgatga gatgcctgct 
900 

gacttgcctt cattagctgc tgattttgtt gaaagtaagg atgtttgcaa aaactatgct 
960 

gaggcaaagg atgtcttcct gggcatgttt ttgtatgaat atgcaagaag gcatcctgat 
1020 

tactctgtcg tgctgctgct gagacttgcc aagacatatg aaaccactct agagaagtgc 
1080 

tgtgccgctg cagatcctca tgaatgctat gccaaagtgt tcgatgaatt taaacctctt 
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gtggaagagc ctcagaattt aatcaaacaa aattgtgagc tttttgagca gcttggagag 
1200 

tacaaattcc agaatgcgct attagttcgt tacaccaaga aagtacccca agtgtcaact 
1260 

ccaactcttg tagaggtctc aagaaaccta ggaaaagtgg gcagcaaatg ttgtaaacat 
1320 

cctgaagcaa aaagaatgcc ctgtgcagaa gactatctat ccgtggtcct gaaccagtta 
1380 

tgtgtgttgc atgagaaaac gccagtaagt gacagagtca ccaaatgctg cacagaatcc 
1440 

ttggtgaaca ggcgaccatg cttttcagct ctggaagtcg atgaaacata cgttcccaaa 

Q 

Qgagtttaatg ctgaaacatt caccttccat gcagatatat gcacactttc tgagaaggag 

agacaaatca agaaacaaac tgcacttgtt gagctcgtga aacacaagcc caaggcaaca 
1^1620 

^ aaagagcaac tgaaagctgt tatggatgat ttcgcagctt ttgtagagaa gtgctgcaag 
|ig 1680 

gggctgacgata aggagacctg ctttgccgag gagggtaaaa aacttgttgc tgcaagtcaa 

6 

eygctgccttag gcttataatg ac 
1762 



<210> 18 

<211> 232 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 18 
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^fjjk Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro 

15 10 15 



Ala Pro Glu Lys Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro 
20 25 30 



Lys Asp Thr Lys Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val 
35 40 45 

Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val 
50 55 60 

Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 
^ 65 70 75 80 

^Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gin 
g 85 90 95 



S3 



Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala 
100 105 110 



Leu Pro Ala Pro lie Glu Lys Thr lie Ser Lys Ala Lys Gly Gin Pro 
^ 115 120 125 

0' 

'"'Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu Met Thr 
130 135 140 

Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser 
145 150 155 160 

Asp lie Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr 

165 170 175 

Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr 
180 185 190 

Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe 
195 200 205 
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Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys 
210 215 220 



Ser Leu Ser Leu Ser Pro Gly Lys 
225 230 



<210> 19 
<211> 229 

<212> PRT ': 
<213> Artificial Sec[ueiice 

CP 

g<220> • 

<223> synthetic contruct 
^ <400> 19 

g; Glu Ser Lys Tyr Gly Pro Pro Cys Pro Pro Cys Pro Ala Pro Glu Phe 
ffl 1 5 10 15 



\^ Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr 
ig 20 25 30 

ry 

Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val 
35 40 45 

Ser Gin Glu Asp Pro Glu Val Gin Phe Asn Trp Tyr Val Asp Gly Val 
50 55 60 

Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin Phe Asn Ser 
65 70 75 80 

Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gin Asp Trp Leu 

85 90 95 



Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser 
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100 105 110 



•Ser lie Qlu Lys Thr lie Ser Lys Ala Lys Gly Gin Pro Arg Qlu Pro 
115 120 125 



Sin Val Tyr Thr Leu Pro Pro Ser Gin Glu Glu Met Thr Lys Asn Gin 
130 135 140 



Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp lie Ala 
145 150 155 160 

Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr Thr 

165 170 175 

©Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Arg Leu 
Q 180 185 190 

5 Thr Val Asp Lys Ser Arg Trp Gin Glu Gly Asn Val Phe Ser Cys Ser 
^ 195 200 205 

gVal Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu Ser 
y 210 215 220 

@ 

^Leu Ser Leu Gly Lys 



ry 



225 

<210> 20 

<211> 585 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 20 
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il Ala His Arg Phc 
5 10 15 



Ala His Lys Ser Glu Val Ala His Arg Phe Lys Asp Leu 61y Glu 



Glu Asn Phe Lys Ala Leu Val Leu lie Ala Phe Ala Gin Tyr Leu Gin 
20 . 25 30 



Gin Cys Pro Phe Glu Asp His Val Lys Leu Val Asn Glu Val Thr Glu 
35 40 45 



Phe Ala Lys Thr Cys Val Ala Asp Glu Ser Ala Glu Asn Cys Asp Lys 
50 55 60 



Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr Val Ala Thr Leu 
65 70 75 80 

m 

yArg Glu Thr Tyr Gly Glu Met Ala Asp Cys Cys Ala Lys Gin Glu Pro 
fiR 85 90 95 



J Glu Arg Asn Glu Cys Phe Leu Gin His Lys Asp Asp Asn Pro Asn Leu 
gg 100 105 110 

yjPro Arg Leu Val Arg Pro Glu Val Asp Val Met Cys Thr Ala Phe His 
p 115 120 125 

ill 

f^Asp Asn Glu Glu Thr Phe Leu Lys Lys Tyr Leu Tyr Glu lie Ala Arg 
130 135 140 

Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe Ala Lys Arg 
145 150 155 160 

Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gin Ala Ala Asp Lys Ala Ala 

165 170 175 

Cys Leu Leu Pro Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser 
180 185 190 

Ser Ala Lys Gin Arg Leu Lys Cys Ala Ser Leu Gin Lys Phe Gly Glu 
195 200 205 
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Ala Phe Lys Ala Trp Ala Val Ala Arg Leu Ser Gin Arg Phe Pro 
210 215 220 

Lys Ala Glu Phe Ala Glu Val Ser Lys Leu Val Thr Asp Leu Thr Lys 
225 230 235 240 

Val His Thr Glu Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp 

245 250 255 

Arg Ala Asp Leu Ala Lys Tyr lie Cys Glu Asn Gin Asp Ser lie Ser 
260 265 - 270 

Ser Lys Leu Lys Glu Cys Cys Glu Lys Pro Leu Leu Glu Lys Ser His 
^ 275 280 285 

^Cys lie Ala Glu Val Glu Asn Asp Glu Met Pro Ala Asp Leu Pro Ser 
^ 290 295 300 

IjgLeu Ala Ala Asp Phe Val Glu Ser Lys Asp Val Cys Lys Asn Tyr Ala 
^ 305 310 315 320 

a 
w 

pGlu Ala Lys Asp Val Phe Leu Gly Met Phe Leu Tyr Glu Tyr Ala Arg 
in . 325 330 335 

Arg His Pro Asp Tyr Ser Val Val Leu Leu Leu Arg Leu Ala Lys Thr 
340 345 350 

Tyr Glu Thr Thr Leu Glu Lys Cys Cys Ala Ala Ala Asp Pro His Glu 
355 360 365 

Cys Tyr Ala Lys Val Phe Asp Glu Phe Lys Pro Leu Val Glu Glu Pro 
370 375 380 

Gin Asn Leu lie Lys Gin Asn Cys Glu Leu Phe Glu Asn Leu Gly Glu 
385 390 395 400 

Tyr Lys Phe Gin Asn Ala Leu Leu Val Arg Tyr Thr Lys Lys Val Pro 

405 410 415 
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Gin Val Ser Thr Pro Thr Leu Val Glu Val Ser Arg Asn Leu Gly Lys 
420 425 430 



Val Gly Ser Lys Cys Cys Lys His Pro Glu Ala Lys Arg Met Pro Cys 
435 440 445 



Ala Glu Asp Tyr Leu Ser Val Val Leu Asn Gin Leu Cys Val Leu His 
450 455 460 

Glu Lys Thr Pro Val Ser Asp Arg Val Thr Lys Cys Cys Thr Glu Ser 
465 470 475 480 

Leu Val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu Val Asp Glu Thr 
Sn 485 490 495 

w 

ff^Tyr Val Pro Lys Glu Phe Asn Ala Glu Thr Phe Thr Phe His Ala Asp 
500 505 510 

PziB Cys Thr Leu Ser Glu Lys Glu Arg Gin lie Lys Lys Gin Thr Ala 
515 520 525 

y 

K^Leu Val Glu Leu Val Lys His Lys Pro Lys Ala Thr Lys Glu Gin Leu 
^ 530 535 540 

Lys Ala Val Met Asp Asp Phe Ala Ala Phe Val Glu Lys Cys Cys Lys 
545 550 555 560 

Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Glu Gly Lys Lys Leu Val 

565 570 575 

Ala Ala Ser Gin Ala Ala Leu Gly Leu 
580 585 



<210> 21 
<211> 703 
<212> DNA 

Page 36 




P-15648, ST25.txt 

Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 21 

gagcccaaat cttgtgacaa aactcacaca tgcccaccgt gcccagcacc tgaactcctg 
60 

aggggaccgt cagtcttcct cttcccccca aaacccaagg acaccctcat gatctcccgg 
120 

acccctgagg tcacatgcgt ggtggtggac gtgagccacg aagaccctga ggtcaagttc 
180 



g aactggtacg tggacggcgt ggaggtgcat aatgccaaga caaagccgcg ggaggagcag 
g240 

sn 

1,^ tacaacagca cgtaccgtgt ggtcagcgtc ctcaccgtcc tgcaccagga ctggctgaat 

gfl ggcaaggagt acaagtgcaa ggtctccaac aaagccctcc cagcccccat cgagaaaacc 

yatctccaaag ccaaagggca gccccgagaa ccacaggtgt acaccctgcc cccatcccgg 

in 

fflgaggagatga ccaagaacca ggtcagcctg acctgcctgg tcaaaggctt ctatcccagc 

gacatcgccg tggagtggga gagcaatggg cagccggaga acaactacaa gaccacgcct 
540 

cccgtgctgg actccgacgg ctccttcttc ctctatagca agctcaccgt ggacaagagc 
600 

aggtggcagc aggggaacgt cttctcatgc tccgtgatgc atgaggctct gcacaaccac 
660 

tacacgcaga agagcctctc cctgtctccg ggtaaatgat agt 
703 

<210> 22 
<211> 981 
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||||12> DNA 

<213> Artificial Sequence 




<220> 

<223> synthetic construct 
<400> 22 

tccaccaagg gcccatcggt cttcccgcta gcgccctgct ccaggagcac ctccgagagc 
60 

acagccgccc tgggctgcct ggtcaaggac tacttccccg aaccggtgac ggtgtcgtgg 
120 

aactcaggcg ccctgaccag cggcgtgcac accttcccgg ctgtcctaca gtcctcagga 

^ ctctactccc tcagcagcgt ggtgaccgtg ccctccagca gcttgggcac gaagacctac 
m 240 

h& 

'gacctgcaacg tagatcacaa gcccagcaac accaaggtgg acaagagagt tgagtccaaa 
^|300 

^ tatggtcccc catgcccacc ctgcccagca cctgagttcc tggggggacc atcagtcttc 
^360 

ggctgttccccc caaaacccaa ggacactctc atgatctccc ggacccctga ggtcacgtgc 

pijgtggtggtgg acgtgagcca ggaagacccc gaggtccagt tcaactggta cgtggatggc 
480 

gtggaggtgc ataatgccaa gacaaagccg cgggaggagc agttcaacag cacgtaccgt 
540 

gtggtcagcg tcctcaccgt cctgcaccag gactggctga acggcaagga gtacaagtgc 
600 

^aggtctcca acaaaggcct cccgtcctcc atcgagaaaa ccatctccaa agccaaaggg 

cagccccgag agccacaggt gtacaccctg cccccatccc aggaggagat gaccaagaac 
720 

caggtcagcc tgacctgcct ggtcaaaggc ttctacccca gcgacatcgc cgtggagtgg 
78 0 

gagagcaatg ggcagccgga gaacaactac aagaccacgc ctcccgtgct ggactccgac 
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ggctccttct tcctctacag caggctaacc gtggacaaga gcaggtggca ggaggggaat 
900 

gtcttctcat gctccgtgat gcatgaggct ctgcacaacc actacacaca gaagagcctc 
960 

tccctgtctc tgggtaaatg a 
981 



<210> 23 

<211> 406 

<212> PRT 

^ <213> Artificial Sequence 

a 

^<220> 

^<223> synthetic construct 
„ <400> 23 

yThr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys 
gl 5 10 15 

m 

yyAla Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin 
20 25 30 

Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val 
35 40 45 

Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys 
50 55 60 

Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser 
65 70 75 80 

Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser 

85 90 95 
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Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 
100 105 110 



Phe Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 
115 120 125 



Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe 
130 135 140 



Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 
145 150 155 160 



Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Glu Pro 

165 170 175 

a 

loLys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu 
Yl 180 185 190 



% Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp 
r 195 200 205 

u 

|t«Thr Leu Met He Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp 
^ 210 215 220 

S ' , 

Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly 
225 230 235 240 



Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin Tyr Asn 

245 250 255 



Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gin Asp Trp 
260 265 270 



Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro 
275 280 285 



Ala Pro He Glu Lys Thr He Ser Lys Ala Lys Gly Gin Pro Arg Glu 
290 295 300 
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Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu Met Thr Lys Asn 
305 310 315 320 



Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp lie 

325 330 335 



Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr 
340 345 350 



Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys 
355 360 365 



Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe Ser Cys 
^ 370 375 380 

w . 

2 Ser Val Met His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu 
385 390 395 400 

^ Ser Leu Ser Pro Gly Lys 

|<210> 24 

Jy<211> 403 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 24 

Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys 
1 5 10 15 



Ala Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin 
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20 25 30 



. Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val 
35 40 45 



Leu Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys 
50 55 60 



Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser 
65 70 75 80 

Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser 

85 90 95 

^ Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 
Q 100 105 110 

5 Phe Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 

120 125 

LAla Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe 
2 130 135 140 

a 
Iff 

&Gln Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 
fyl45 150 155 160 

Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Glu Ser 

165 170 175 

Lys Tyr Gly Pro Pro Cys Pro Ser Cys Pro Ala Pro Glu Phe Leu Gly 
180 185 190 

Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr Leu Met 
195 200 205 

lie Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val Ser Gin 
210 215 220 
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e Asn Trp Tyr Va] 
230 235 240 



Asp Pro Glu Val Gin Phe Asn Trp Tyr Val Asp Gly Val Glu Val 



His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin Phe Asn Ser Thr Tyr 

245 250 255 



Arg Val Val Ser Val Leu Thr Val Leu His Gin Asp Trp Leu Asn Gly 
260 265 270 



Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Gly Leu Pro Ser Ser lie 
275 280 285 



Glu Lys Thr lie Ser Lys Ala Lys Gly Gin Pro Arg Glu Pro Gin Val 
290 295 300 

QTyr Thr Leu Pro Pro Ser Gin Glu Glu Met Thr Lys Asn Gin Val Ser 
W305 310 315 320 

f 

M 

^Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp lie Ala Val Glu 
J 325 330 335 

QTrp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro 
W 340 345 350 

dVal Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Arg Leu Thr Val 
fy 355 360 365 



Asp Lys Ser Arg Trp Gin Glu Gly Asn Val Phe Ser Cys Ser Val Met 
370 375 380 



His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu Ser Leu Ser 
385 390 395 400 

Leu Gly Lys 



<210> 25 
<211> 500 
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L2> PRT 

<213> Artificial Sequence 

<220> 

<223> synthetic construct 

<400> 25 

Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys 
1 . 5 10 15 



Ala Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin 
m 20 25 ■ 30 

a 

y Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val 
ft 35 40 45 



% Leu Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys 
^ 50 55 60 

a 
y 

g Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser 
Ij 65 70 75 80 

h 

Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser 

85 90 95 



Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 
100 105 110 



Phe Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 
115 120 125 



Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe 
130 135 140 



Gin Arg Tkrg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 
145 150 155 160 
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Leu Glu Val Ser Tyx Axg Val Leu Arg His Leu Ala Gin Pro Gly Gly 

165 170 175 



Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Asp Ala His 
180 185 190 



Lys Ser Glu Val Ala His Arg Phe Lys Asp Leu Gly Glu Glu Asn Phe 
195 200 205 



Lys Ala Leu Val Leu lie Ala Phe Ala Gin Tyr Leu Gin Gin Cys Pro 
210 215 220 



Phe Glu Asp His Val Lys Leu Val Asn Glu Val Thr Glu Phe Ala Lys 
225 230 235 240 



Thr Cys Val Ala Asp Glu Ser Ala Glu Asn Cys Asp Lys Ser Leu His 

245 250 255 



gPThr Leu Phe Gly Asp Lys Leu Cys Thr Val Ala Thr Leu Arg Glu Thr 
p 260 265 270 

MTyr Gly Glu Met Ala Asp Cys Cys Ala Lys Gin Glu Pro Glu Arg Asn 
275 280 285 

» Glu Cys Phe Leu Gin His Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu 

d 290 295 300 

yiVal Arg Pro Glu Val Asp Val Met Cys Thr Ala Phe His Asp Asn Glu 

§3305 310 315 320 



Glu Thr Phe Leu Lys Lys Tyr Leu Tyr Glu lie Ala Arg Arg His Pro 

325 330 335 



Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe Ala Lys Arg Tyr Lys Ala 
340 345 350 



Ala Phe Thr Glu Cys Cys Gin Ala Ala Asp Lys Ala Ala Cys Leu Leu 
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355 360 365 



Pro Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser Ser Ala Lys 
370 375 380 



Gin Arg Leu Lys Cys Ala Ser Leu Gin Lys Phe Gly Glu Arg Ala Phe 
385 390 395 400 



Lys Ala Trp Ala Val Ala Arg Leu Ser Gin Arg Phe Pro Lys Ala Glu 

405 410 415 



Phe Ala Glu Val Ser Lys Leu Val Thr Asp Leu Thr Lys Val His Thr 
420 425 430 

ffi Glu . Cys Cys His Gly Asp Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp 
2 435 440 445 

en 

J Leu Ala Lys Tyr lie Cys Glu Asn Gin Asp Ser lie Ser Ser Lys Leu 
^ 450 455 460 

Lys Glu Cys Cys Glu Lys Pro Leu Leu Glu Lys Ser His Cys lie Ala 
y465 470 475 480 

^ Glu Val Glu Asn Asp Glu Met Pro Ala Asp Leu Pro Ser Leu Ala Ala 
^ 485 490 495 

Asp Phe Val Glu 
500 



<210> 26 

<211> 69 

<212> DNA 

<213> Artificial Sequence 



<220> 
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HP 
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3> synthetic construct 



<400> 26 

gtaagcttgc gtcgacgcta gcggcgcgcc gccatggccg gacctgccac ccagagcccc 
60 

atgaagctg 
69 



<210> 27 

<211> 61 

.<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic constrxict 
<400> 27 

<4n ggggcaggga gctggctggg cccagtggag tggcttcctg cactgtccag agtgcactgt 

ii <210> 28 
L <211> 59 



y <212> DNA 

HI 

m 
ru 



0 <213> Artificial Sequence 



<220> 

'<223> synthetic construct 
<400> 28 

ggacagtgca ggaagccact ccactgggcc cagccagctc cctgccccag agcttcctg 
59 



Page 47 



P-15648.ST25.txt 

^I^Q> 29 

<211> 72 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 29 

gaacctcgag gatcctcatt agggctgggc aaggtgcctt aagacgcggt acgacacctc 
60 

caggaagctc tg 

w. 

^ <210> 30 
^ <211> 69 

<212> DNA 
y <213> Artificial Sequence 

m 

fU <220> 

<223> synthetic construct 
<400> 30 

gtaagcttgc gtcgacgcta gcggcgcgcc gccatggccg gacctgccac ccagagcccc 
60 

atgaagctg 
69 

. <210> 31 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> synthetic construct 

<400> 31 

gctctaaggc cttgagcagg aagctctggg gcagggagct cgctgggccc agtggag 
57 

<210> 32 

<211> 53 
<212> 

^ <213> Artificial Sequence 
^& <220> 

^ <223> synthetic construct 

« <400> 32 

0 gggcccagcg agctccctgc cccagagctt cctgctcaag gccttagagc aag 
^ 53 

Ml 

P <210> 33 

it! 

<211> 72 

-<212> DNA. 

<213> Artificial Sequence 



<220> 

<223> synthetic construct' 
<400> 33 

gaacctcgag gatcctcatt agggctgggc aaggtgcctt aagacgcggt acgacacctc 
60 

caggaagctc tg 
72 
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<210> 34 

<211> 69 

<212> DNA. 

<213> Artificial Sequence 



<220> 

<223>- synthetic construct 
<400> 34 

gtaagcttgc gtcgacgcta gcggcgcgcc gccatggccg gacctgccac ccagagcccc 
60 

0 atgaagctg 
UJ 69 

M 

^ <210> 35 

& <211> 61 

s 

P <212> DNA. 

© <213> Artificial Sequence 

yi 
a 
ru 

<220> 

<223> synthetic construct 
<400> 35 

gtccgagcag cactagttcc tcggggtggc acagcttggt ggtgttacac agcttctcct 
60 

g 

61 



<210> 36 
<211> 66 
<212> DMA 
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•l3> Artificial Sequence 



<220> 

<223> synthetic construct 
<40p> 36 

ggcgcagcgc tccaggagaa gctgtgtaac accaccaagc tgtgccaccc cgaggaacta 
60 

gtgctg 
66 

<210> 37 
P <211> 72 
y <212> DNA 

<213> Artificial Sequence 

I 

^ <223> synthetic construct 
|| <400> 37 

gaacctcgag gatcctcatt agggctgggc aaggtgcctt aagacgcggt acgacacctc 

caggaagctc tg 
72 



<210> 38 

<211> 69 

<212> DNA 

<213> Artificial Sequence 



<220> 
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|23> synthetic construct 
<400> 38 

gtaagcttgc gtcgacgcta gcggcgcgcc gccatggccg gacctgccac ccagagcccc 
60 

atgaagctg 
69 



<210> 39 

<211> 61 

<212> DNA 

<213> Artificial Sequence 

> 

<220> 

<223> synthetic construct 

\fl^<400> 39 

^ gcccggcgct ggaaagcgct ggcgaaggcc ggcatggcgg tctggttggg ctgcagggca 

m 60 

^g 
y 61 

0<210> 40 

ru 

<211> 60 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 40 

ggcccctgcc ctgcagccca accagaccgc catgccggcc ttcgccagcg ctttccagcg 
60 
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0> 41 



<211> 72 
<212> DHA. 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 41- 

gaacctcgag gatcctcatt agggctgggc aaggtgcctt aagacgcggt acgacacctc 
60 

caggaagctc tg 

m 72 

a 

<210> 42 

J <211> 69 

<212> DNA 

y <213> Artificial Seqpience 

in 

p§ <220> 

<223> synthetic construct 
<400> 42 

gtaagcttgc gtcgacgcta gcggcgcgcc gccatggccg gacctgccac ccagagcccc 
60 

atgaagctg 
69 

<210> 43 
<211> 68 
<212> DNA 

<213> Artificial Sequence 
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• 



<220> 

<223> synthetic construct 
<400> 43 

gcccggcgct ggaaggtaga gttgaaggcc ggcatggcac cctgggtggg ctgaagagca 
60 

ggggccat 
68 

<210> 44 
<211> 74 
2 <212> DNA 

tops 

1^ <213> Artificial SeG[uence 

{ 

^ <220>- 
!!^<400> 44 

gggaatggcc cctgctcttc agcccaccca gggtgccatg ccggccttca actctacctt 

d 

' ccagcgccgg gcag 
74 



^ <223> synthetic construct 



<210> 45 

<211> 72 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
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0> 45 

cctcgag gatcctcatt agggctgggc aaggtgcctt aagacgcggt acgacacctc 

60 . 



caggaagctc tg 
72 



<210> 46 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



65 

j^<223> synthetic construct 
^<400> 46 

^gctagcggcg cgccaccatg 
gg<210> 47 

y 

P<211> 33 

in 

Q<212> DNA 

ry 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 

<400> 47 

gctcagggta gcgttaacga tgcccagaga gtg 
33 

<210> 48 

<211> 30 

<212> DNA 
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.3> Artificial Sequence 



<220> 

<223> synthetic construct 

<400> 48 

gggcatcgtt aacgctaccc tgagcagctg 
30 

<210> 49 

<211> 27 

I <212> DNA 

<213> Artificial Secjuence 



m 



f <220> 



<223> synthetic construct 



y <400> 49 

gg gactcgagga tcctcattag ggctggg 

ru 

<:210> 50 

<211> 38 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 50 

gctagcggcg cgccaccatg gccggacctg ccacccag 
38 
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^fc.O> 51 

<211> 45 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 51 

caagcagccg gccagctggg tggcgttgct ggggcagctg ctcag 
45 

P, <210> 52 

ffl <211> 37 

..^ <212> DNA 

^ <213> Artificial Sequence 



ry 



<223> synthetic construct 
<400> 52 

gccccagcaa cgccacccag ctggccggct gcttgag 
37 



<210> 53 

<211> 47 

<212> Dm 

<213> Artificial Sequence 
<220> 
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m 



P-15648.ST25.txt 
3> synthetic construct 



<400> 53 

gactcgagga tcctcattag ggctgggcaa ggtgccttaa gacgcgg 
47 



<210> 54 

<211> 20 

<212> DNA. 

<213> Artificial Sequence 



<220> 

^ <223> synthetic construct 

W 

<400> 54 
|,s gctagcggcg cgccaccatg 
i 20 

m 

^ <210> 55 

14 <211> 27 

Ifl <212> DNA 

nl <213> l^tificial Secpience 



<220> 

<223> synthetic construct 

<400> 55 

ggggcaacta gtcaggttag cccaggg 
27 

<210> 56 

<211> 27 

<212> DNA 
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^^||p.3> Artificial Sequence 



<220> 

<223> synthetic construct 

<400> 56 

gctaacctga ctagttgccc cagccag 
27 

<210> 57 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
^ <220> 

§ <223> synthetic constiixct 

<400> 57 
y gactcgagga tcctcattag ggctggg 
& 27 

m 

fU <210> 58 
<211> 20 
. <212>^ DHA 

<213> Artificial Sequence 



W 



<220> 

<223> synthetic constoruct 
<400> 58 

gctagcggcg cgccaccatg 
20 
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^^0> 59 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 59 

ggtgcaattg ctcaggggag cccag 
25 

§ <210> 60 

^ <211> 23 

^ <212> DNA 

^ <213> Artificial Sequence 

Q <220> 
W 

^ <223> synthetic construct 

ru 

<400> 60 

gcaattgcac cagccaggcc ctg 
23 

<210> 61 

<211> 27 

<212> DNA 

<213> Artificial Secpience 



<220> 
<223> 



synthetic construct 
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'©00> 61 

gactcgagga tcctcattag ggctggg 
27 

<210> 62 
<211> 38 
<212> DNA. 

<213> Artificial Sequence 



<220> 

IP <223> synthetic construct 

S <400> 62 

^ gctagcggcg cgccaccatg gccggacctg ccacccag 

g <210> 63 

<211> 37 

u 

p <212> DNA 

in 

m <213> Artificial Sequence 



<220> 

<223> synthetic constanict 

<400> 63 

ccggactggt cccgttcagg gcctgcagga gcccctg 
37 

<210> 64 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> synthetic construct 

<400> 64 

gaacgggacc agtccggagt tgggtcccac cttgg 
35 

<210> 65 

<211> 47 

<212> DNA 

<213> Artificial Sequence 



ffl <220> 

^ <223> synthetic construct 

^ <400> 65 

^ gactcgagga tcctcattag ggctgggcaa ggtgccttaa gacgcgg 
£3 47 

e 

W <210> 66 

ft! <211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 66 

gtcgacgcta gcggcgcgcc accatggccg gacctg 
36 
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